Overview
Brought to you by YData
Dataset statistics
| Number of variables | 42 |
|---|---|
| Number of observations | 196294 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 122 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 64.4 MiB |
| Average record size in memory | 344.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 33 |
| Text | 1 |
| Dataset has 122 (0.1%) duplicate rows | Duplicates |
age is highly overall correlated with detailed_household_and_family_stat and 2 other fields | High correlation |
citizenship is highly overall correlated with country_of_birth_father and 2 other fields | High correlation |
class_of_worker is highly overall correlated with detailed_industry_recode and 4 other fields | High correlation |
country_of_birth_father is highly overall correlated with citizenship and 3 other fields | High correlation |
country_of_birth_mother is highly overall correlated with citizenship and 3 other fields | High correlation |
country_of_birth_self is highly overall correlated with citizenship and 2 other fields | High correlation |
detailed_household_and_family_stat is highly overall correlated with age and 4 other fields | High correlation |
detailed_household_summary_in_household is highly overall correlated with detailed_household_and_family_stat and 3 other fields | High correlation |
detailed_industry_recode is highly overall correlated with class_of_worker and 2 other fields | High correlation |
detailed_occupation_recode is highly overall correlated with class_of_worker and 2 other fields | High correlation |
education is highly overall correlated with tax_filer_stat and 1 other fields | High correlation |
family_members_under_18 is highly overall correlated with detailed_household_and_family_stat and 3 other fields | High correlation |
fill_inc_questionnaire_for_veteran's_admin is highly overall correlated with veterans_benefits | High correlation |
full_or_part_time_employment_stat is highly overall correlated with live_in_this_house_1_year_ago and 2 other fields | High correlation |
hispanic_origin is highly overall correlated with country_of_birth_father and 1 other fields | High correlation |
live_in_this_house_1_year_ago is highly overall correlated with full_or_part_time_employment_stat and 6 other fields | High correlation |
major_industry_code is highly overall correlated with class_of_worker and 3 other fields | High correlation |
major_occupation_code is highly overall correlated with class_of_worker and 3 other fields | High correlation |
marital_stat is highly overall correlated with tax_filer_stat | High correlation |
migration_code_change_in_msa is highly overall correlated with live_in_this_house_1_year_ago and 5 other fields | High correlation |
migration_code_change_in_reg is highly overall correlated with full_or_part_time_employment_stat and 4 other fields | High correlation |
migration_code_move_within_reg is highly overall correlated with live_in_this_house_1_year_ago and 5 other fields | High correlation |
migration_prev_res_in_sunbelt is highly overall correlated with live_in_this_house_1_year_ago and 3 other fields | High correlation |
num_persons_worked_for_employer is highly overall correlated with class_of_worker and 2 other fields | High correlation |
region_of_previous_residence is highly overall correlated with live_in_this_house_1_year_ago and 3 other fields | High correlation |
tax_filer_stat is highly overall correlated with age and 8 other fields | High correlation |
veterans_benefits is highly overall correlated with age and 6 other fields | High correlation |
weeks_worked_in_year is highly overall correlated with num_persons_worked_for_employer and 1 other fields | High correlation |
year is highly overall correlated with full_or_part_time_employment_stat and 4 other fields | High correlation |
enroll_in_edu_inst_last_wk is highly imbalanced (74.4%) | Imbalance |
race is highly imbalanced (62.0%) | Imbalance |
hispanic_origin is highly imbalanced (71.5%) | Imbalance |
member_of_a_labor_union is highly imbalanced (67.1%) | Imbalance |
reason_for_unemployment is highly imbalanced (89.3%) | Imbalance |
region_of_previous_residence is highly imbalanced (77.9%) | Imbalance |
migration_code_move_within_reg is highly imbalanced (54.4%) | Imbalance |
migration_prev_res_in_sunbelt is highly imbalanced (69.8%) | Imbalance |
family_members_under_18 is highly imbalanced (50.4%) | Imbalance |
country_of_birth_father is highly imbalanced (70.7%) | Imbalance |
country_of_birth_mother is highly imbalanced (71.4%) | Imbalance |
country_of_birth_self is highly imbalanced (81.6%) | Imbalance |
citizenship is highly imbalanced (65.3%) | Imbalance |
own_business_or_self_employed is highly imbalanced (67.6%) | Imbalance |
fill_inc_questionnaire_for_veteran's_admin is highly imbalanced (94.4%) | Imbalance |
target is highly imbalanced (66.0%) | Imbalance |
dividends_from_stocks is highly skewed (γ1 = 27.56720148) | Skewed |
age has 2643 (1.3%) zeros | Zeros |
wage_per_hour has 184991 (94.2%) zeros | Zeros |
capital_gains has 188915 (96.2%) zeros | Zeros |
capital_losses has 192388 (98.0%) zeros | Zeros |
dividends_from_stocks has 175156 (89.2%) zeros | Zeros |
num_persons_worked_for_employer has 92770 (47.3%) zeros | Zeros |
weeks_worked_in_year has 92770 (47.3%) zeros | Zeros |
Reproduction
| Analysis started | 2025-01-20 00:36:58.941612 |
|---|---|
| Analysis finished | 2025-01-20 00:37:53.710621 |
| Duration | 54.77 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
age
Real number (ℝ)
High correlation  Zeros 
| Distinct | 91 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.929468 |
| Minimum | 0 |
|---|---|
| Maximum | 90 |
| Zeros | 2643 |
| Zeros (%) | 1.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 16 |
| median | 34 |
| Q3 | 50 |
| 95-th percentile | 75 |
| Maximum | 90 |
| Range | 90 |
| Interquartile range (IQR) | 34 |
Descriptive statistics
| Standard deviation | 22.210001 |
|---|---|
| Coefficient of variation (CV) | 0.63585282 |
| Kurtosis | -0.72745803 |
| Mean | 34.929468 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 0.35720878 |
| Sum | 6856445 |
| Variance | 493.28413 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34 | 3486 | 1.8% |
| 35 | 3450 | 1.8% |
| 36 | 3352 | 1.7% |
| 31 | 3349 | 1.7% |
| 33 | 3340 | 1.7% |
| 37 | 3278 | 1.7% |
| 38 | 3277 | 1.7% |
| 30 | 3202 | 1.6% |
| 32 | 3187 | 1.6% |
| 39 | 3144 | 1.6% |
| Other values (81) | 163229 |
| Value | Count | Frequency (%) |
| 0 | 2643 | |
| 1 | 2954 | |
| 2 | 3031 | |
| 3 | 3059 | |
| 4 | 3108 | |
| 5 | 3090 | |
| 6 | 3014 | |
| 7 | 2980 | |
| 8 | 3004 | |
| 9 | 2941 |
| Value | Count | Frequency (%) |
| 90 | 722 | |
| 89 | 195 | 0.1% |
| 88 | 241 | 0.1% |
| 87 | 301 | |
| 86 | 348 | |
| 85 | 423 | |
| 84 | 519 | |
| 83 | 561 | |
| 82 | 614 | |
| 81 | 718 |
class_of_worker
Categorical
High correlation 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not in universe | |
|---|---|
| Private sector | |
| Government | |
| Self-employed | |
| Not employed | 603 |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 14.124186 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Self-employed |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 97029 | |
| Private sector | 72021 | |
| Government | 14935 | 7.6% |
| Self-employed | 11706 | 6.0% |
| Not employed | 603 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 97632 | |
| in | 97029 | |
| universe | 97029 | |
| private | 72021 | |
| sector | 72021 | |
| government | 14935 | 3.2% |
| self-employed | 11706 | 2.5% |
| employed | 603 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 404294 | |
| 266682 | ||
| i | 266079 | |
| t | 256609 | |
| r | 256006 | |
| n | 223928 | |
| o | 196897 | |
| v | 183985 | |
| s | 169050 | 6.1% |
| N | 97632 | 3.5% |
| Other values (13) | 451331 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2297811 | |
| Space Separator | 266682 | 9.6% |
| Uppercase Letter | 196294 | 7.1% |
| Dash Punctuation | 11706 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 404294 | |
| i | 266079 | |
| t | 256609 | |
| r | 256006 | |
| n | 223928 | |
| o | 196897 | |
| v | 183985 | |
| s | 169050 | |
| u | 97029 | 4.2% |
| c | 72021 | 3.1% |
| Other values (7) | 171913 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 97632 | |
| P | 72021 | |
| G | 14935 | 7.6% |
| S | 11706 | 6.0% |
Space Separator
| Value | Count | Frequency (%) |
| 266682 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11706 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2494105 | |
| Common | 278388 | 10.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 404294 | |
| i | 266079 | |
| t | 256609 | |
| r | 256006 | |
| n | 223928 | |
| o | 196897 | |
| v | 183985 | |
| s | 169050 | |
| N | 97632 | 3.9% |
| u | 97029 | 3.9% |
| Other values (11) | 342596 |
Common
| Value | Count | Frequency (%) |
| 266682 | ||
| - | 11706 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2772493 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 404294 | |
| 266682 | ||
| i | 266079 | |
| t | 256609 | |
| r | 256006 | |
| n | 223928 | |
| o | 196897 | |
| v | 183985 | |
| s | 169050 | 6.1% |
| N | 97632 | 3.5% |
| Other values (13) | 451331 |
detailed_industry_recode
Categorical
High correlation 
| Distinct | 42 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not in universe or children | |
|---|---|
| Public administration | |
| Manufacturing | 9367 |
| Manufacturing-durable goods | 5984 |
| Business and repair services | 5973 |
| Other values (37) |
Length
| Max length | 58 |
|---|---|
| Median length | 27 |
| Mean length | 24.836225 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe or children |
|---|---|
| 2nd row | Manufacturing-durable goods |
| 3rd row | Not in universe or children |
| 4th row | Not in universe or children |
| 5th row | Not in universe or children |
Common Values
| Value | Count | Frequency (%) |
| Not in universe or children | 97467 | |
| Public administration | 17521 | 8.9% |
| Manufacturing | 9367 | 4.8% |
| Manufacturing-durable goods | 5984 | 3.0% |
| Business and repair services | 5973 | 3.0% |
| Public administration and armed forces | 4683 | 2.4% |
| Wholesale and retail trade | 4648 | 2.4% |
| Professional services | 4616 | 2.4% |
| Trade | 4482 | 2.3% |
| Professional and related services | 3889 | 2.0% |
| Other values (32) | 37664 | 19.2% |
Length
| Value | Count | Frequency (%) |
| not | 99171 | |
| or | 97467 | |
| children | 97467 | |
| in | 97467 | |
| universe | 97467 | |
| services | 27917 | 3.8% |
| and | 27058 | 3.7% |
| public | 23275 | 3.2% |
| administration | 22204 | 3.0% |
| trade | 10605 | 1.5% |
| Other values (45) | 131263 |
Most occurring characters
| Value | Count | Frequency (%) |
| 535067 | ||
| i | 514356 | |
| n | 480741 | |
| e | 473536 | 9.7% |
| r | 467280 | 9.6% |
| o | 294803 | 6.0% |
| s | 272236 | 5.6% |
| t | 227261 | 4.7% |
| a | 210547 | 4.3% |
| u | 204908 | 4.2% |
| Other values (29) | 1194467 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4111773 | |
| Space Separator | 535067 | 11.0% |
| Uppercase Letter | 199151 | 4.1% |
| Other Punctuation | 16529 | 0.3% |
| Dash Punctuation | 12682 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 514356 | |
| n | 480741 | |
| e | 473536 | |
| r | 467280 | |
| o | 294803 | 7.2% |
| s | 272236 | 6.6% |
| t | 227261 | 5.5% |
| a | 210547 | 5.1% |
| u | 204908 | 5.0% |
| c | 197282 | 4.8% |
| Other values (11) | 768823 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 99171 | |
| P | 35429 | 17.8% |
| M | 25814 | 13.0% |
| B | 8910 | 4.5% |
| T | 8654 | 4.3% |
| W | 6592 | 3.3% |
| H | 4045 | 2.0% |
| F | 2119 | 1.1% |
| E | 2077 | 1.0% |
| S | 1644 | 0.8% |
| Other values (5) | 4696 | 2.4% |
Space Separator
| Value | Count | Frequency (%) |
| 535067 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 16529 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12682 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4310924 | |
| Common | 564278 | 11.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 514356 | |
| n | 480741 | |
| e | 473536 | |
| r | 467280 | |
| o | 294803 | 6.8% |
| s | 272236 | 6.3% |
| t | 227261 | 5.3% |
| a | 210547 | 4.9% |
| u | 204908 | 4.8% |
| c | 197282 | 4.6% |
| Other values (26) | 967974 |
Common
| Value | Count | Frequency (%) |
| 535067 | ||
| , | 16529 | 2.9% |
| - | 12682 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4875202 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 535067 | ||
| i | 514356 | |
| n | 480741 | |
| e | 473536 | 9.7% |
| r | 467280 | 9.6% |
| o | 294803 | 6.0% |
| s | 272236 | 5.6% |
| t | 227261 | 4.7% |
| a | 210547 | 4.3% |
| u | 204908 | 4.2% |
| Other values (29) | 1194467 |
detailed_occupation_recode
Categorical
High correlation 
| Distinct | 47 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not in universe | |
|---|---|
| Other executive, admin and managerial | 8756 |
| Food service occupations | 7886 |
| Computer equipment operators | 5412 |
| Personal service occupations | 5105 |
| Other values (42) |
Length
| Max length | 46 |
|---|---|
| Median length | 43 |
| Mean length | 23.385646 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Automobile mechanics and repairers |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 97467 | |
| Other executive, admin and managerial | 8756 | 4.5% |
| Food service occupations | 7886 | 4.0% |
| Computer equipment operators | 5412 | 2.8% |
| Personal service occupations | 5105 | 2.6% |
| Construction trades | 4144 | 2.1% |
| Automobile mechanics and repairers | 4025 | 2.1% |
| Teachers, except college and university | 3683 | 1.9% |
| Supervisors and proprietors, sales occupations | 3445 | 1.8% |
| Other administrative support occupations | 3392 | 1.7% |
| Other values (37) | 52979 |
Length
| Value | Count | Frequency (%) |
| not | 97467 | |
| universe | 97467 | |
| in | 97467 | |
| occupations | 47831 | 7.3% |
| and | 46816 | 7.1% |
| other | 21968 | 3.3% |
| service | 18046 | 2.7% |
| operators | 10243 | 1.6% |
| related | 9882 | 1.5% |
| admin | 9300 | 1.4% |
| Other values (83) | 200943 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 502225 | |
| 461136 | ||
| i | 419088 | 9.1% |
| n | 410204 | 8.9% |
| o | 330710 | 7.2% |
| t | 318313 | 6.9% |
| r | 316394 | 6.9% |
| s | 309366 | 6.7% |
| a | 258923 | 5.6% |
| u | 201716 | 4.4% |
| Other values (30) | 1062387 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3912476 | |
| Space Separator | 461136 | 10.0% |
| Uppercase Letter | 196294 | 4.3% |
| Other Punctuation | 20556 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 502225 | |
| i | 419088 | |
| n | 410204 | |
| o | 330710 | |
| t | 318313 | |
| r | 316394 | |
| s | 309366 | |
| a | 258923 | 6.6% |
| u | 201716 | 5.2% |
| c | 194338 | 5.0% |
| Other values (15) | 651199 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 97908 | |
| O | 22512 | 11.5% |
| F | 17128 | 8.7% |
| C | 12809 | 6.5% |
| P | 12314 | 6.3% |
| M | 7395 | 3.8% |
| S | 5287 | 2.7% |
| H | 4933 | 2.5% |
| E | 4529 | 2.3% |
| T | 4421 | 2.3% |
| Other values (3) | 7058 | 3.6% |
Space Separator
| Value | Count | Frequency (%) |
| 461136 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 20556 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4108770 | |
| Common | 481692 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 502225 | |
| i | 419088 | |
| n | 410204 | |
| o | 330710 | 8.0% |
| t | 318313 | 7.7% |
| r | 316394 | 7.7% |
| s | 309366 | 7.5% |
| a | 258923 | 6.3% |
| u | 201716 | 4.9% |
| c | 194338 | 4.7% |
| Other values (28) | 847493 |
Common
| Value | Count | Frequency (%) |
| 461136 | ||
| , | 20556 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4590462 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 502225 | |
| 461136 | ||
| i | 419088 | 9.1% |
| n | 410204 | 8.9% |
| o | 330710 | 7.2% |
| t | 318313 | 6.9% |
| r | 316394 | 6.9% |
| s | 309366 | 6.7% |
| a | 258923 | 5.6% |
| u | 201716 | 4.4% |
| Other values (30) | 1062387 |
education
Categorical
High correlation 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| High School Graduate | |
|---|---|
| Children | |
| Some College | |
| Below High School | |
| College Graduate |
Length
| Max length | 20 |
|---|---|
| Median length | 16 |
| Mean length | 14.551112 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | High School Graduate |
|---|---|
| 2nd row | Some College |
| 3rd row | Below High School |
| 4th row | Children |
| 5th row | Children |
Common Values
| Value | Count | Frequency (%) |
| High School Graduate | 48374 | |
| Children | 44347 | |
| Some College | 37530 | |
| Below High School | 36588 | |
| College Graduate | 19859 | |
| Advanced Degree | 9596 | 4.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| high | 84962 | |
| school | 84962 | |
| graduate | 68233 | |
| college | 57389 | |
| children | 44347 | |
| some | 37530 | |
| below | 36588 | |
| advanced | 9596 | 2.2% |
| degree | 9596 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 339860 | |
| o | 301431 | 10.6% |
| l | 280675 | 9.8% |
| 236909 | 8.3% | |
| h | 214271 | 7.5% |
| g | 151947 | 5.3% |
| a | 146062 | 5.1% |
| d | 131772 | 4.6% |
| i | 129309 | 4.5% |
| S | 122492 | 4.3% |
| Other values (14) | 801568 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2186184 | |
| Uppercase Letter | 433203 | 15.2% |
| Space Separator | 236909 | 8.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 339860 | |
| o | 301431 | |
| l | 280675 | |
| h | 214271 | |
| g | 151947 | |
| a | 146062 | |
| d | 131772 | 6.0% |
| i | 129309 | 5.9% |
| r | 122176 | 5.6% |
| c | 94558 | 4.3% |
| Other values (6) | 274123 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 122492 | |
| C | 101736 | |
| H | 84962 | |
| G | 68233 | |
| B | 36588 | 8.4% |
| A | 9596 | 2.2% |
| D | 9596 | 2.2% |
Space Separator
| Value | Count | Frequency (%) |
| 236909 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2619387 | |
| Common | 236909 | 8.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 339860 | |
| o | 301431 | |
| l | 280675 | |
| h | 214271 | 8.2% |
| g | 151947 | 5.8% |
| a | 146062 | 5.6% |
| d | 131772 | 5.0% |
| i | 129309 | 4.9% |
| S | 122492 | 4.7% |
| r | 122176 | 4.7% |
| Other values (13) | 679392 |
Common
| Value | Count | Frequency (%) |
| 236909 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2856296 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 339860 | |
| o | 301431 | 10.6% |
| l | 280675 | 9.8% |
| 236909 | 8.3% | |
| h | 214271 | 7.5% |
| g | 151947 | 5.3% |
| a | 146062 | 5.1% |
| d | 131772 | 4.6% |
| i | 129309 | 4.5% |
| S | 122492 | 4.3% |
| Other values (14) | 801568 |
wage_per_hour
Real number (ℝ)
Zeros 
| Distinct | 1240 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 56.336505 |
| Minimum | 0 |
|---|---|
| Maximum | 9999 |
| Zeros | 184991 |
| Zeros (%) | 94.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 500 |
| Maximum | 9999 |
| Range | 9999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 277.05433 |
|---|---|
| Coefficient of variation (CV) | 4.9178473 |
| Kurtosis | 152.75307 |
| Mean | 56.336505 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.8617868 |
| Sum | 11058518 |
| Variance | 76759.103 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 184991 | |
| 500 | 734 | 0.4% |
| 600 | 546 | 0.3% |
| 700 | 534 | 0.3% |
| 800 | 507 | 0.3% |
| 1000 | 386 | 0.2% |
| 425 | 375 | 0.2% |
| 900 | 336 | 0.2% |
| 550 | 280 | 0.1% |
| 1200 | 256 | 0.1% |
| Other values (1230) | 7349 | 3.7% |
| Value | Count | Frequency (%) |
| 0 | 184991 | |
| 20 | 1 | < 0.1% |
| 70 | 1 | < 0.1% |
| 75 | 2 | < 0.1% |
| 100 | 11 | < 0.1% |
| 110 | 1 | < 0.1% |
| 125 | 1 | < 0.1% |
| 135 | 1 | < 0.1% |
| 143 | 1 | < 0.1% |
| 150 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 9999 | 1 | < 0.1% |
| 9916 | 1 | < 0.1% |
| 9800 | 2 | |
| 9400 | 2 | |
| 9000 | 1 | < 0.1% |
| 8800 | 1 | < 0.1% |
| 8600 | 1 | < 0.1% |
| 8500 | 1 | < 0.1% |
| 8300 | 1 | < 0.1% |
| 8000 | 4 |
enroll_in_edu_inst_last_wk
Categorical
Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not in universe | |
|---|---|
| High school | 6853 |
| College or university | 5679 |
Length
| Max length | 22 |
|---|---|
| Median length | 16 |
| Mean length | 16.033939 |
| Min length | 12 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | High school |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 183762 | |
| High school | 6853 | 3.5% |
| College or university | 5679 | 2.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 183762 | |
| in | 183762 | |
| universe | 183762 | |
| high | 6853 | 1.2% |
| school | 6853 | 1.2% |
| college | 5679 | 1.0% |
| or | 5679 | 1.0% |
| university | 5679 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 582029 | ||
| i | 385735 | |
| e | 384561 | |
| n | 373203 | |
| o | 208826 | 6.6% |
| s | 196294 | 6.2% |
| r | 195120 | 6.2% |
| v | 189441 | 6.0% |
| u | 189441 | 6.0% |
| t | 189441 | 6.0% |
| Other values (8) | 253275 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2369043 | |
| Space Separator | 582029 | 18.5% |
| Uppercase Letter | 196294 | 6.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 385735 | |
| e | 384561 | |
| n | 373203 | |
| o | 208826 | |
| s | 196294 | |
| r | 195120 | |
| v | 189441 | |
| u | 189441 | |
| t | 189441 | |
| l | 18211 | 0.8% |
| Other values (4) | 38770 | 1.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 183762 | |
| H | 6853 | 3.5% |
| C | 5679 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 582029 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2565337 | |
| Common | 582029 | 18.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 385735 | |
| e | 384561 | |
| n | 373203 | |
| o | 208826 | |
| s | 196294 | |
| r | 195120 | |
| v | 189441 | |
| u | 189441 | |
| t | 189441 | |
| N | 183762 | |
| Other values (7) | 69513 | 2.7% |
Common
| Value | Count | Frequency (%) |
| 582029 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3147366 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 582029 | ||
| i | 385735 | |
| e | 384561 | |
| n | 373203 | |
| o | 208826 | 6.6% |
| s | 196294 | 6.2% |
| r | 195120 | 6.2% |
| v | 189441 | 6.0% |
| u | 189441 | 6.0% |
| t | 189441 | 6.0% |
| Other values (8) | 253275 |
marital_stat
Categorical
High correlation 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Married | |
|---|---|
| Never Married | |
| Divorced | |
| Widowed | |
| Separated | 3459 |
Length
| Max length | 21 |
|---|---|
| Median length | 13 |
| Mean length | 9.7542309 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Widowed |
|---|---|
| 2nd row | Divorced |
| 3rd row | Never Married |
| 4th row | Never Married |
| 5th row | Never Married |
Common Values
| Value | Count | Frequency (%) |
| Married | 84859 | |
| Never Married | 83296 | |
| Divorced | 12707 | 6.5% |
| Widowed | 10456 | 5.3% |
| Separated | 3459 | 1.8% |
| Married-spouse absent | 1517 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| married | 168155 | |
| never | 83296 | |
| divorced | 12707 | 4.5% |
| widowed | 10456 | 3.7% |
| separated | 3459 | 1.2% |
| married-spouse | 1517 | 0.5% |
| absent | 1517 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 438806 | |
| e | 369379 | |
| d | 206750 | |
| i | 192835 | |
| a | 178107 | |
| M | 169672 | 8.9% |
| v | 96003 | 5.0% |
| 84813 | 4.4% | |
| N | 83296 | 4.4% |
| o | 24680 | 1.3% |
| Other values (12) | 70356 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1548777 | |
| Uppercase Letter | 279590 | 14.6% |
| Space Separator | 84813 | 4.4% |
| Dash Punctuation | 1517 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 438806 | |
| e | 369379 | |
| d | 206750 | |
| i | 192835 | |
| a | 178107 | |
| v | 96003 | 6.2% |
| o | 24680 | 1.6% |
| c | 12707 | 0.8% |
| w | 10456 | 0.7% |
| p | 4976 | 0.3% |
| Other values (5) | 14078 | 0.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 169672 | |
| N | 83296 | |
| D | 12707 | 4.5% |
| W | 10456 | 3.7% |
| S | 3459 | 1.2% |
Space Separator
| Value | Count | Frequency (%) |
| 84813 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1517 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1828367 | |
| Common | 86330 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 438806 | |
| e | 369379 | |
| d | 206750 | |
| i | 192835 | |
| a | 178107 | |
| M | 169672 | 9.3% |
| v | 96003 | 5.3% |
| N | 83296 | 4.6% |
| o | 24680 | 1.3% |
| c | 12707 | 0.7% |
| Other values (10) | 56132 | 3.1% |
Common
| Value | Count | Frequency (%) |
| 84813 | ||
| - | 1517 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1914697 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 438806 | |
| e | 369379 | |
| d | 206750 | |
| i | 192835 | |
| a | 178107 | |
| M | 169672 | 8.9% |
| v | 96003 | 5.0% |
| 84813 | 4.4% | |
| N | 83296 | 4.4% |
| o | 24680 | 1.3% |
| Other values (12) | 70356 | 3.7% |
major_industry_code
Categorical
High correlation 
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not in universe or children | |
|---|---|
| Retail trade | |
| Manufacturing-durable goods | 9014 |
| Education | 8283 |
| Manufacturing-nondurable goods | 6895 |
| Other values (19) |
Length
| Max length | 36 |
|---|---|
| Median length | 28 |
| Mean length | 24.337417 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe or children |
|---|---|
| 2nd row | Construction |
| 3rd row | Not in universe or children |
| 4th row | Not in universe or children |
| 5th row | Not in universe or children |
Common Values
| Value | Count | Frequency (%) |
| Not in universe or children | 97467 | |
| Retail trade | 17069 | 8.7% |
| Manufacturing-durable goods | 9014 | 4.6% |
| Education | 8283 | 4.2% |
| Manufacturing-nondurable goods | 6895 | 3.5% |
| Finance insurance and real estate | 6145 | 3.1% |
| Construction | 5984 | 3.0% |
| Business and repair services | 5651 | 2.9% |
| Medical except hospital | 4683 | 2.4% |
| Public administration | 4610 | 2.3% |
| Other values (14) | 30493 | 15.5% |
Length
| Value | Count | Frequency (%) |
| not | 97467 | |
| universe | 97467 | |
| or | 97467 | |
| children | 97467 | |
| in | 97467 | |
| services | 21704 | 3.1% |
| trade | 20663 | 2.9% |
| retail | 17069 | 2.4% |
| goods | 15909 | 2.2% |
| and | 13160 | 1.9% |
| Other values (34) | 135458 |
Most occurring characters
| Value | Count | Frequency (%) |
| 711298 | ||
| e | 483445 | |
| i | 445075 | 9.3% |
| n | 436324 | 9.1% |
| r | 434473 | 9.1% |
| o | 298089 | 6.2% |
| t | 238790 | 5.0% |
| s | 230048 | 4.8% |
| a | 190730 | 4.0% |
| c | 185335 | 3.9% |
| Other values (28) | 1123682 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3847878 | |
| Space Separator | 711298 | 14.9% |
| Uppercase Letter | 202204 | 4.2% |
| Dash Punctuation | 15909 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 483445 | |
| i | 445075 | |
| n | 436324 | |
| r | 434473 | |
| o | 298089 | |
| t | 238790 | 6.2% |
| s | 230048 | 6.0% |
| a | 190730 | 5.0% |
| c | 185335 | 4.8% |
| u | 184035 | 4.8% |
| Other values (11) | 721534 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 97467 | |
| M | 21155 | 10.5% |
| R | 17069 | 8.4% |
| E | 9933 | 4.9% |
| H | 9838 | 4.9% |
| P | 8492 | 4.2% |
| C | 7165 | 3.5% |
| F | 6367 | 3.1% |
| B | 5651 | 2.8% |
| O | 4482 | 2.2% |
| Other values (5) | 14585 | 7.2% |
Space Separator
| Value | Count | Frequency (%) |
| 711298 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15909 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4050082 | |
| Common | 727207 | 15.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 483445 | |
| i | 445075 | |
| n | 436324 | |
| r | 434473 | |
| o | 298089 | 7.4% |
| t | 238790 | 5.9% |
| s | 230048 | 5.7% |
| a | 190730 | 4.7% |
| c | 185335 | 4.6% |
| u | 184035 | 4.5% |
| Other values (26) | 923738 |
Common
| Value | Count | Frequency (%) |
| 711298 | ||
| - | 15909 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4777289 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 711298 | ||
| e | 483445 | |
| i | 445075 | 9.3% |
| n | 436324 | 9.1% |
| r | 434473 | 9.1% |
| o | 298089 | 6.2% |
| t | 238790 | 5.0% |
| s | 230048 | 4.8% |
| a | 190730 | 4.0% |
| c | 185335 | 3.9% |
| Other values (28) | 1123682 |
major_occupation_code
Categorical
High correlation 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not in universe | |
|---|---|
| Adm support including clerical | |
| Professional specialty | |
| Executive admin and managerial | |
| Other service | |
| Other values (10) |
Length
| Max length | 38 |
|---|---|
| Median length | 36 |
| Mean length | 20.842002 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Precision production craft & repair |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 97467 | |
| Adm support including clerical | 14836 | 7.6% |
| Professional specialty | 13940 | 7.1% |
| Executive admin and managerial | 12495 | 6.4% |
| Other service | 12097 | 6.2% |
| Sales | 11781 | 6.0% |
| Precision production craft & repair | 10517 | 5.4% |
| Machine operators assmblrs & inspctrs | 6377 | 3.2% |
| Handlers equip cleaners etc | 4126 | 2.1% |
| Transportation and material moving | 4020 | 2.0% |
| Other values (5) | 8638 | 4.4% |
Length
| Value | Count | Frequency (%) |
| not | 97467 | |
| in | 97467 | |
| universe | 97467 | |
| and | 22676 | 3.7% |
| support | 17854 | 2.9% |
| 16894 | 2.8% | |
| clerical | 14836 | 2.4% |
| adm | 14836 | 2.4% |
| including | 14836 | 2.4% |
| professional | 13940 | 2.3% |
| Other values (33) | 204739 |
Most occurring characters
| Value | Count | Frequency (%) |
| 617138 | ||
| i | 408259 | |
| e | 403678 | |
| n | 352634 | 8.6% |
| r | 296592 | 7.2% |
| s | 257072 | 6.3% |
| t | 214090 | 5.2% |
| o | 205966 | 5.0% |
| a | 201609 | 4.9% |
| u | 158075 | 3.9% |
| Other values (24) | 976047 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3260798 | |
| Space Separator | 617138 | 15.1% |
| Uppercase Letter | 196330 | 4.8% |
| Other Punctuation | 16894 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 408259 | |
| e | 403678 | |
| n | 352634 | |
| r | 296592 | |
| s | 257072 | |
| t | 214090 | 6.6% |
| o | 205966 | 6.3% |
| a | 201609 | 6.2% |
| u | 158075 | 4.8% |
| c | 145771 | 4.5% |
| Other values (12) | 617052 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 97467 | |
| P | 26898 | 13.7% |
| A | 14872 | 7.6% |
| E | 12495 | 6.4% |
| O | 12097 | 6.2% |
| S | 11781 | 6.0% |
| T | 7038 | 3.6% |
| M | 6377 | 3.2% |
| H | 4126 | 2.1% |
| F | 3179 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 617138 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 16894 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3457128 | |
| Common | 634032 | 15.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 408259 | |
| e | 403678 | |
| n | 352634 | |
| r | 296592 | 8.6% |
| s | 257072 | 7.4% |
| t | 214090 | 6.2% |
| o | 205966 | 6.0% |
| a | 201609 | 5.8% |
| u | 158075 | 4.6% |
| c | 145771 | 4.2% |
| Other values (22) | 813382 |
Common
| Value | Count | Frequency (%) |
| 617138 | ||
| & | 16894 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4091160 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 617138 | ||
| i | 408259 | |
| e | 403678 | |
| n | 352634 | 8.6% |
| r | 296592 | 7.2% |
| s | 257072 | 6.3% |
| t | 214090 | 5.2% |
| o | 205966 | 5.0% |
| a | 201609 | 4.9% |
| u | 158075 | 3.9% |
| Other values (24) | 976047 |
race
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| White | |
|---|---|
| Black | |
| Asian or Pacific Islander | 5821 |
| Other | 3645 |
| Amer Indian Aleut or Eskimo | 2242 |
Length
| Max length | 28 |
|---|---|
| Median length | 6 |
| Mean length | 6.8443661 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | White |
|---|---|
| 2nd row | White |
| 3rd row | Asian or Pacific Islander |
| 4th row | White |
| 5th row | White |
Common Values
| Value | Count | Frequency (%) |
| White | 164380 | |
| Black | 20206 | 10.3% |
| Asian or Pacific Islander | 5821 | 3.0% |
| Other | 3645 | 1.9% |
| Amer Indian Aleut or Eskimo | 2242 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| white | 164380 | |
| black | 20206 | 9.1% |
| or | 8063 | 3.6% |
| asian | 5821 | 2.6% |
| pacific | 5821 | 2.6% |
| islander | 5821 | 2.6% |
| other | 3645 | 1.6% |
| amer | 2242 | 1.0% |
| indian | 2242 | 1.0% |
| aleut | 2242 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 222725 | ||
| i | 186327 | |
| e | 178330 | |
| t | 170267 | |
| h | 168025 | |
| W | 164380 | |
| a | 39911 | 3.0% |
| c | 31848 | 2.4% |
| l | 28269 | 2.1% |
| k | 22448 | 1.7% |
| Other values (14) | 130978 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 906121 | |
| Space Separator | 222725 | 16.6% |
| Uppercase Letter | 214662 | 16.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 186327 | |
| e | 178330 | |
| t | 170267 | |
| h | 168025 | |
| a | 39911 | 4.4% |
| c | 31848 | 3.5% |
| l | 28269 | 3.1% |
| k | 22448 | 2.5% |
| r | 19771 | 2.2% |
| n | 16126 | 1.8% |
| Other values (6) | 44799 | 4.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 164380 | |
| B | 20206 | 9.4% |
| A | 10305 | 4.8% |
| I | 8063 | 3.8% |
| P | 5821 | 2.7% |
| O | 3645 | 1.7% |
| E | 2242 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 222725 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1120783 | |
| Common | 222725 | 16.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 186327 | |
| e | 178330 | |
| t | 170267 | |
| h | 168025 | |
| W | 164380 | |
| a | 39911 | 3.6% |
| c | 31848 | 2.8% |
| l | 28269 | 2.5% |
| k | 22448 | 2.0% |
| B | 20206 | 1.8% |
| Other values (13) | 110772 |
Common
| Value | Count | Frequency (%) |
| 222725 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1343508 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 222725 | ||
| i | 186327 | |
| e | 178330 | |
| t | 170267 | |
| h | 168025 | |
| W | 164380 | |
| a | 39911 | 3.0% |
| c | 31848 | 2.4% |
| l | 28269 | 2.1% |
| k | 22448 | 1.7% |
| Other values (14) | 130978 |
hispanic_origin
Categorical
High correlation  Imbalance 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| All other | |
|---|---|
| Mexican-American | 8008 |
| Mexican (Mexicano) | 7210 |
| Central or South American | 3891 |
| Puerto Rican | 3306 |
| Other values (5) | 5076 |
Length
| Max length | 26 |
|---|---|
| Median length | 10 |
| Mean length | 10.980417 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | All other |
|---|---|
| 2nd row | All other |
| 3rd row | All other |
| 4th row | All other |
| 5th row | All other |
Common Values
| Value | Count | Frequency (%) |
| All other | 168803 | |
| Mexican-American | 8008 | 4.1% |
| Mexican (Mexicano) | 7210 | 3.7% |
| Central or South American | 3891 | 2.0% |
| Puerto Rican | 3306 | 1.7% |
| Other Spanish | 2476 | 1.3% |
| Cuban | 1122 | 0.6% |
| NA | 870 | 0.4% |
| Do not know | 305 | 0.2% |
| Chicano | 303 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| other | 171279 | |
| all | 168803 | |
| mexican-american | 8008 | 2.1% |
| mexican | 7210 | 1.8% |
| mexicano | 7210 | 1.8% |
| central | 3891 | 1.0% |
| or | 3891 | 1.0% |
| south | 3891 | 1.0% |
| american | 3891 | 1.0% |
| rican | 3306 | 0.8% |
| Other values (8) | 8992 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 390372 | ||
| l | 341497 | |
| e | 212803 | |
| r | 194266 | |
| o | 188319 | |
| t | 182672 | |
| A | 181572 | |
| h | 177949 | |
| n | 46035 | 2.1% |
| a | 45425 | 2.1% |
| Other values (21) | 194480 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1516644 | |
| Space Separator | 390372 | 18.1% |
| Uppercase Letter | 225946 | 10.5% |
| Dash Punctuation | 8008 | 0.4% |
| Open Punctuation | 7210 | 0.3% |
| Close Punctuation | 7210 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 341497 | |
| e | 212803 | |
| r | 194266 | |
| o | 188319 | |
| t | 182672 | |
| h | 177949 | |
| n | 46035 | 3.0% |
| a | 45425 | 3.0% |
| i | 40412 | 2.7% |
| c | 37936 | 2.5% |
| Other values (8) | 49330 | 3.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 181572 | |
| M | 22428 | 9.9% |
| S | 6367 | 2.8% |
| C | 5316 | 2.4% |
| P | 3306 | 1.5% |
| R | 3306 | 1.5% |
| O | 2476 | 1.1% |
| N | 870 | 0.4% |
| D | 305 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 390372 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8008 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 7210 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 7210 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1742590 | |
| Common | 412800 | 19.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 341497 | |
| e | 212803 | |
| r | 194266 | |
| o | 188319 | |
| t | 182672 | |
| A | 181572 | |
| h | 177949 | |
| n | 46035 | 2.6% |
| a | 45425 | 2.6% |
| i | 40412 | 2.3% |
| Other values (17) | 131640 | 7.6% |
Common
| Value | Count | Frequency (%) |
| 390372 | ||
| - | 8008 | 1.9% |
| ( | 7210 | 1.7% |
| ) | 7210 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2155390 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 390372 | ||
| l | 341497 | |
| e | 212803 | |
| r | 194266 | |
| o | 188319 | |
| t | 182672 | |
| A | 181572 | |
| h | 177949 | |
| n | 46035 | 2.1% |
| a | 45425 | 2.1% |
| Other values (21) | 194480 |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.043333 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Male |
| 3rd row | Female |
| 4th row | Female |
| 5th row | Female |
Common Values
| Value | Count | Frequency (%) |
| Female | 102400 | |
| Male | 93894 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| female | 102400 | |
| male | 93894 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 298694 | |
| 196294 | ||
| a | 196294 | |
| l | 196294 | |
| F | 102400 | 8.6% |
| m | 102400 | 8.6% |
| M | 93894 | 7.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 793682 | |
| Space Separator | 196294 | 16.5% |
| Uppercase Letter | 196294 | 16.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 298694 | |
| a | 196294 | |
| l | 196294 | |
| m | 102400 | 12.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 102400 | |
| M | 93894 |
Space Separator
| Value | Count | Frequency (%) |
| 196294 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 989976 | |
| Common | 196294 | 16.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 298694 | |
| a | 196294 | |
| l | 196294 | |
| F | 102400 | 10.3% |
| m | 102400 | 10.3% |
| M | 93894 | 9.5% |
Common
| Value | Count | Frequency (%) |
| 196294 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1186270 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 298694 | |
| 196294 | ||
| a | 196294 | |
| l | 196294 | |
| F | 102400 | 8.6% |
| m | 102400 | 8.6% |
| M | 93894 | 7.9% |
member_of_a_labor_union
Categorical
Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not in universe | |
|---|---|
| No | 16032 |
| Yes | 3030 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 14.753013 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 177232 | |
| No | 16032 | 8.2% |
| Yes | 3030 | 1.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 177232 | |
| in | 177232 | |
| universe | 177232 | |
| no | 16032 | 2.9% |
| yes | 3030 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 550758 | ||
| e | 357494 | |
| i | 354464 | |
| n | 354464 | |
| N | 193264 | 6.7% |
| o | 193264 | 6.7% |
| s | 180262 | 6.2% |
| t | 177232 | 6.1% |
| u | 177232 | 6.1% |
| v | 177232 | 6.1% |
| Other values (2) | 180262 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2148876 | |
| Space Separator | 550758 | 19.0% |
| Uppercase Letter | 196294 | 6.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 357494 | |
| i | 354464 | |
| n | 354464 | |
| o | 193264 | |
| s | 180262 | |
| t | 177232 | |
| u | 177232 | |
| v | 177232 | |
| r | 177232 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 193264 | |
| Y | 3030 | 1.5% |
Space Separator
| Value | Count | Frequency (%) |
| 550758 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2345170 | |
| Common | 550758 | 19.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 357494 | |
| i | 354464 | |
| n | 354464 | |
| N | 193264 | |
| o | 193264 | |
| s | 180262 | |
| t | 177232 | |
| u | 177232 | |
| v | 177232 | |
| r | 177232 |
Common
| Value | Count | Frequency (%) |
| 550758 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2895928 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 550758 | ||
| e | 357494 | |
| i | 354464 | |
| n | 354464 | |
| N | 193264 | 6.7% |
| o | 193264 | 6.7% |
| s | 180262 | 6.2% |
| t | 177232 | 6.1% |
| u | 177232 | 6.1% |
| v | 177232 | 6.1% |
| Other values (2) | 180262 | 6.2% |
reason_for_unemployment
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not in universe | |
|---|---|
| Job loser | 3014 |
| Re-entrant | 2018 |
| Job leaver | 598 |
| New entrant | 438 |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 14.832313 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 190226 | |
| Job loser | 3014 | 1.5% |
| Re-entrant | 2018 | 1.0% |
| Job leaver | 598 | 0.3% |
| New entrant | 438 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 190226 | |
| in | 190226 | |
| universe | 190226 | |
| job | 3612 | 0.6% |
| loser | 3014 | 0.5% |
| re-entrant | 2018 | 0.3% |
| leaver | 598 | 0.1% |
| new | 438 | 0.1% |
| entrant | 438 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 389574 | |
| n | 385364 | |
| 384502 | ||
| i | 380452 | |
| o | 196852 | |
| r | 196294 | |
| t | 195138 | |
| s | 193240 | |
| v | 190824 | |
| N | 190664 | |
| Other values (8) | 208590 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2328680 | |
| Space Separator | 384502 | 13.2% |
| Uppercase Letter | 196294 | 6.7% |
| Dash Punctuation | 2018 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 389574 | |
| n | 385364 | |
| i | 380452 | |
| o | 196852 | |
| r | 196294 | |
| t | 195138 | |
| s | 193240 | |
| v | 190824 | |
| u | 190226 | |
| b | 3612 | 0.2% |
| Other values (3) | 7104 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 190664 | |
| J | 3612 | 1.8% |
| R | 2018 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 384502 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2018 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2524974 | |
| Common | 386520 | 13.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 389574 | |
| n | 385364 | |
| i | 380452 | |
| o | 196852 | |
| r | 196294 | |
| t | 195138 | |
| s | 193240 | |
| v | 190824 | |
| N | 190664 | |
| u | 190226 | |
| Other values (6) | 16346 | 0.6% |
Common
| Value | Count | Frequency (%) |
| 384502 | ||
| - | 2018 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2911494 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 389574 | |
| n | 385364 | |
| 384502 | ||
| i | 380452 | |
| o | 196852 | |
| r | 196294 | |
| t | 195138 | |
| s | 193240 | |
| v | 190824 | |
| N | 190664 | |
| Other values (8) | 208590 |
full_or_part_time_employment_stat
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Children or Armed Forces | |
|---|---|
| FTE | |
| Not Employed | |
| PTE | 5898 |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 17.130875 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not Employed |
|---|---|
| 2nd row | Children or Armed Forces |
| 3rd row | Not Employed |
| 4th row | Children or Armed Forces |
| 5th row | Children or Armed Forces |
Common Values
| Value | Count | Frequency (%) |
| Children or Armed Forces | 120632 | |
| FTE | 43038 | 21.9% |
| Not Employed | 26726 | 13.6% |
| PTE | 5898 | 3.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| children | 120632 | |
| or | 120632 | |
| armed | 120632 | |
| forces | 120632 | |
| fte | 43038 | 7.4% |
| not | 26726 | 4.6% |
| employed | 26726 | 4.6% |
| pte | 5898 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 482528 | |
| e | 388622 | |
| 388622 | ||
| o | 294716 | 8.8% |
| d | 267990 | 8.0% |
| F | 163670 | 4.9% |
| m | 147358 | 4.4% |
| l | 147358 | 4.4% |
| h | 120632 | 3.6% |
| s | 120632 | 3.6% |
| Other values (12) | 840560 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2411910 | |
| Uppercase Letter | 562156 | 16.7% |
| Space Separator | 388622 | 11.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 482528 | |
| e | 388622 | |
| o | 294716 | |
| d | 267990 | |
| m | 147358 | 6.1% |
| l | 147358 | 6.1% |
| h | 120632 | 5.0% |
| s | 120632 | 5.0% |
| c | 120632 | 5.0% |
| n | 120632 | 5.0% |
| Other values (4) | 200810 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 163670 | |
| C | 120632 | |
| A | 120632 | |
| E | 75662 | |
| T | 48936 | 8.7% |
| N | 26726 | 4.8% |
| P | 5898 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 388622 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2974066 | |
| Common | 388622 | 11.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 482528 | |
| e | 388622 | |
| o | 294716 | |
| d | 267990 | 9.0% |
| F | 163670 | 5.5% |
| m | 147358 | 5.0% |
| l | 147358 | 5.0% |
| h | 120632 | 4.1% |
| s | 120632 | 4.1% |
| c | 120632 | 4.1% |
| Other values (11) | 719928 |
Common
| Value | Count | Frequency (%) |
| 388622 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3362688 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 482528 | |
| e | 388622 | |
| 388622 | ||
| o | 294716 | 8.8% |
| d | 267990 | 8.0% |
| F | 163670 | 4.9% |
| m | 147358 | 4.4% |
| l | 147358 | 4.4% |
| h | 120632 | 3.6% |
| s | 120632 | 3.6% |
| Other values (12) | 840560 |
capital_gains
Real number (ℝ)
Zeros 
| Distinct | 132 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 441.87004 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 188915 |
| Zeros (%) | 96.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4735.677 |
|---|---|
| Coefficient of variation (CV) | 10.717353 |
| Kurtosis | 386.64929 |
| Mean | 441.87004 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 18.835992 |
| Sum | 86736437 |
| Variance | 22426637 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 188915 | |
| 15024 | 788 | 0.4% |
| 7688 | 609 | 0.3% |
| 7298 | 582 | 0.3% |
| 99999 | 390 | 0.2% |
| 3103 | 237 | 0.1% |
| 5178 | 207 | 0.1% |
| 5013 | 158 | 0.1% |
| 4386 | 151 | 0.1% |
| 3325 | 121 | 0.1% |
| Other values (122) | 4136 | 2.1% |
| Value | Count | Frequency (%) |
| 0 | 188915 | |
| 114 | 11 | < 0.1% |
| 401 | 33 | < 0.1% |
| 594 | 88 | < 0.1% |
| 914 | 17 | < 0.1% |
| 991 | 59 | < 0.1% |
| 1055 | 69 | < 0.1% |
| 1086 | 81 | < 0.1% |
| 1090 | 2 | < 0.1% |
| 1111 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 390 | |
| 41310 | 2 | < 0.1% |
| 34095 | 11 | < 0.1% |
| 27828 | 94 | < 0.1% |
| 25236 | 23 | < 0.1% |
| 25124 | 18 | < 0.1% |
| 22040 | 2 | < 0.1% |
| 20051 | 91 | < 0.1% |
| 18481 | 14 | < 0.1% |
| 15831 | 16 | < 0.1% |
capital_losses
Real number (ℝ)
Zeros 
| Distinct | 113 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.927593 |
| Minimum | 0 |
|---|---|
| Maximum | 4608 |
| Zeros | 192388 |
| Zeros (%) | 98.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 4608 |
| Range | 4608 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 274.08117 |
|---|---|
| Coefficient of variation (CV) | 7.226432 |
| Kurtosis | 60.558557 |
| Mean | 37.927593 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.567395 |
| Sum | 7444959 |
| Variance | 75120.49 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 192388 | |
| 1902 | 407 | 0.2% |
| 1977 | 381 | 0.2% |
| 1887 | 364 | 0.2% |
| 1602 | 193 | 0.1% |
| 2415 | 122 | 0.1% |
| 1485 | 95 | < 0.1% |
| 1848 | 88 | < 0.1% |
| 1876 | 87 | < 0.1% |
| 1672 | 85 | < 0.1% |
| Other values (103) | 2084 | 1.1% |
| Value | Count | Frequency (%) |
| 0 | 192388 | |
| 155 | 1 | < 0.1% |
| 213 | 10 | < 0.1% |
| 323 | 10 | < 0.1% |
| 419 | 29 | < 0.1% |
| 625 | 25 | < 0.1% |
| 653 | 7 | < 0.1% |
| 772 | 5 | < 0.1% |
| 810 | 5 | < 0.1% |
| 880 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 4608 | 4 | < 0.1% |
| 4356 | 30 | |
| 3900 | 2 | < 0.1% |
| 3770 | 5 | < 0.1% |
| 3683 | 4 | < 0.1% |
| 3500 | 10 | < 0.1% |
| 3175 | 8 | < 0.1% |
| 3004 | 11 | < 0.1% |
| 2824 | 27 | |
| 2788 | 7 | < 0.1% |
dividends_from_stocks
Real number (ℝ)
Skewed  Zeros 
| Distinct | 1478 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 200.72239 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 175156 |
| Zeros (%) | 89.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 400 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2000.1306 |
|---|---|
| Coefficient of variation (CV) | 9.9646614 |
| Kurtosis | 1073.3032 |
| Mean | 200.72239 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 27.567201 |
| Sum | 39400600 |
| Variance | 4000522.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 175156 | |
| 100 | 1148 | 0.6% |
| 500 | 1030 | 0.5% |
| 1000 | 894 | 0.5% |
| 200 | 866 | 0.4% |
| 50 | 831 | 0.4% |
| 2000 | 574 | 0.3% |
| 250 | 555 | 0.3% |
| 150 | 549 | 0.3% |
| 300 | 523 | 0.3% |
| Other values (1468) | 14168 | 7.2% |
| Value | Count | Frequency (%) |
| 0 | 175156 | |
| 1 | 472 | 0.2% |
| 2 | 193 | 0.1% |
| 3 | 129 | 0.1% |
| 4 | 75 | < 0.1% |
| 5 | 179 | 0.1% |
| 6 | 100 | 0.1% |
| 7 | 93 | < 0.1% |
| 8 | 94 | < 0.1% |
| 9 | 56 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 25 | |
| 95095 | 1 | < 0.1% |
| 75000 | 5 | < 0.1% |
| 70000 | 3 | < 0.1% |
| 66621 | 2 | < 0.1% |
| 60000 | 7 | < 0.1% |
| 57678 | 1 | < 0.1% |
| 55000 | 1 | < 0.1% |
| 54600 | 2 | < 0.1% |
| 54500 | 2 | < 0.1% |
tax_filer_stat
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Joint Filer | |
|---|---|
| Non-Filer | |
| Individual Filer |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 11.409406 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Non-Filer |
|---|---|
| 2nd row | Individual Filer |
| 3rd row | Non-Filer |
| 4th row | Non-Filer |
| 5th row | Non-Filer |
Common Values
| Value | Count | Frequency (%) |
| Joint Filer | 79557 | |
| Non-Filer | 71903 | |
| Individual Filer | 44834 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| filer | 124391 | |
| joint | 79557 | |
| non-filer | 71903 | |
| individual | 44834 | 14.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 365519 | |
| l | 241128 | |
| e | 196294 | |
| r | 196294 | |
| n | 196294 | |
| F | 196294 | |
| o | 151460 | 6.8% |
| 124391 | 5.6% | |
| d | 89668 | 4.0% |
| J | 79557 | 3.6% |
| Other values (7) | 402699 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1650716 | |
| Uppercase Letter | 392588 | 17.5% |
| Space Separator | 124391 | 5.6% |
| Dash Punctuation | 71903 | 3.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 365519 | |
| l | 241128 | |
| e | 196294 | |
| r | 196294 | |
| n | 196294 | |
| o | 151460 | |
| d | 89668 | 5.4% |
| t | 79557 | 4.8% |
| v | 44834 | 2.7% |
| u | 44834 | 2.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 196294 | |
| J | 79557 | |
| N | 71903 | 18.3% |
| I | 44834 | 11.4% |
Space Separator
| Value | Count | Frequency (%) |
| 124391 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 71903 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2043304 | |
| Common | 196294 | 8.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 365519 | |
| l | 241128 | |
| e | 196294 | |
| r | 196294 | |
| n | 196294 | |
| F | 196294 | |
| o | 151460 | |
| d | 89668 | 4.4% |
| J | 79557 | 3.9% |
| t | 79557 | 3.9% |
| Other values (5) | 251239 |
Common
| Value | Count | Frequency (%) |
| 124391 | ||
| - | 71903 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2239598 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 365519 | |
| l | 241128 | |
| e | 196294 | |
| r | 196294 | |
| n | 196294 | |
| F | 196294 | |
| o | 151460 | 6.8% |
| 124391 | 5.6% | |
| d | 89668 | 4.0% |
| J | 79557 | 3.6% |
| Other values (7) | 402699 |
region_of_previous_residence
Categorical
High correlation  Imbalance 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not in universe | |
|---|---|
| South | 4875 |
| West | 4068 |
| Midwest | 3559 |
| Northeast | 2700 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 15.271807 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | South |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 180562 | |
| South | 4875 | 2.5% |
| West | 4068 | 2.1% |
| Midwest | 3559 | 1.8% |
| Northeast | 2700 | 1.4% |
| Abroad | 530 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 180562 | |
| in | 180562 | |
| universe | 180562 | |
| south | 4875 | 0.9% |
| west | 4068 | 0.7% |
| midwest | 3559 | 0.6% |
| northeast | 2700 | 0.5% |
| abroad | 530 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 557418 | ||
| e | 371451 | |
| i | 364683 | |
| n | 361124 | |
| t | 198464 | 6.6% |
| s | 190889 | 6.4% |
| o | 188667 | 6.3% |
| u | 185437 | 6.2% |
| r | 183792 | 6.1% |
| N | 183262 | 6.1% |
| Other values (10) | 212577 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2244052 | |
| Space Separator | 557418 | 18.6% |
| Uppercase Letter | 196294 | 6.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 371451 | |
| i | 364683 | |
| n | 361124 | |
| t | 198464 | |
| s | 190889 | |
| o | 188667 | |
| u | 185437 | |
| r | 183792 | |
| v | 180562 | |
| h | 7575 | 0.3% |
| Other values (4) | 11408 | 0.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 183262 | |
| S | 4875 | 2.5% |
| W | 4068 | 2.1% |
| M | 3559 | 1.8% |
| A | 530 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 557418 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2440346 | |
| Common | 557418 | 18.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 371451 | |
| i | 364683 | |
| n | 361124 | |
| t | 198464 | |
| s | 190889 | |
| o | 188667 | |
| u | 185437 | |
| r | 183792 | |
| N | 183262 | |
| v | 180562 | |
| Other values (9) | 32015 | 1.3% |
Common
| Value | Count | Frequency (%) |
| 557418 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2997764 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 557418 | ||
| e | 371451 | |
| i | 364683 | |
| n | 361124 | |
| t | 198464 | 6.6% |
| s | 190889 | 6.4% |
| o | 188667 | 6.3% |
| u | 185437 | 6.2% |
| r | 183792 | 6.1% |
| N | 183262 | 6.1% |
| Other values (10) | 212577 | 7.1% |
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 16 |
| Mean length | 15.449194 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Arkansas |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
| Value | Count | Frequency (%) |
| not | 180562 | |
| universe | 180562 | |
| in | 180562 | |
| california | 1710 | 0.3% |
| north | 1307 | 0.2% |
| utah | 1061 | 0.2% |
| new | 974 | 0.2% |
| carolina | 905 | 0.2% |
| florida | 847 | 0.2% |
| 707 | 0.1% | |
| Other values (46) | 11192 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 560389 | ||
| i | 373914 | |
| n | 370809 | |
| e | 366790 | |
| o | 192226 | 6.3% |
| r | 188882 | 6.2% |
| s | 186123 | 6.1% |
| t | 186023 | 6.1% |
| N | 183194 | 6.0% |
| u | 181785 | 6.0% |
| Other values (36) | 242449 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2273043 | |
| Space Separator | 560389 | 18.5% |
| Uppercase Letter | 198445 | 6.5% |
| Other Punctuation | 707 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 373914 | |
| n | 370809 | |
| e | 366790 | |
| o | 192226 | |
| r | 188882 | |
| s | 186123 | |
| t | 186023 | |
| u | 181785 | |
| v | 180935 | |
| a | 18992 | 0.8% |
| Other values (14) | 26564 | 1.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 183194 | |
| C | 3084 | 1.6% |
| M | 2531 | 1.3% |
| A | 1623 | 0.8% |
| O | 1069 | 0.5% |
| U | 1061 | 0.5% |
| I | 927 | 0.5% |
| F | 847 | 0.4% |
| D | 821 | 0.4% |
| W | 577 | 0.3% |
| Other values (10) | 2711 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 560389 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 707 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2471488 | |
| Common | 561096 | 18.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 373914 | |
| n | 370809 | |
| e | 366790 | |
| o | 192226 | |
| r | 188882 | |
| s | 186123 | |
| t | 186023 | |
| N | 183194 | |
| u | 181785 | |
| v | 180935 | |
| Other values (34) | 60807 | 2.5% |
Common
| Value | Count | Frequency (%) |
| 560389 | ||
| ? | 707 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3032584 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 560389 | ||
| i | 373914 | |
| n | 370809 | |
| e | 366790 | |
| o | 192226 | 6.3% |
| r | 188882 | 6.2% |
| s | 186123 | 6.1% |
| t | 186023 | 6.1% |
| N | 183194 | 6.0% |
| u | 181785 | 6.0% |
| Other values (36) | 242449 |
detailed_household_and_family_stat
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Primary Householder | |
|---|---|
| Child | |
| Extended Family | 9646 |
| Other | 7041 |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 13.844376 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Extended Family |
|---|---|
| 2nd row | Primary Householder |
| 3rd row | Child |
| 4th row | Child |
| 5th row | Child |
Common Values
| Value | Count | Frequency (%) |
| Primary Householder | 117117 | |
| Child | 62490 | |
| Extended Family | 9646 | 4.9% |
| Other | 7041 | 3.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| primary | 117117 | |
| householder | 117117 | |
| child | 62490 | |
| extended | 9646 | 3.0% |
| family | 9646 | 3.0% |
| other | 7041 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 358392 | |
| e | 260567 | 9.6% |
| o | 234234 | 8.6% |
| d | 198899 | 7.3% |
| i | 189253 | 7.0% |
| l | 189253 | 7.0% |
| h | 186648 | 6.9% |
| m | 126763 | 4.7% |
| a | 126763 | 4.7% |
| y | 126763 | 4.7% |
| Other values (12) | 720033 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2267748 | |
| Uppercase Letter | 323057 | 11.9% |
| Space Separator | 126763 | 4.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 358392 | |
| e | 260567 | |
| o | 234234 | |
| d | 198899 | |
| i | 189253 | |
| l | 189253 | |
| h | 186648 | |
| m | 126763 | 5.6% |
| a | 126763 | 5.6% |
| y | 126763 | 5.6% |
| Other values (5) | 270213 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 117117 | |
| H | 117117 | |
| C | 62490 | |
| E | 9646 | 3.0% |
| F | 9646 | 3.0% |
| O | 7041 | 2.2% |
Space Separator
| Value | Count | Frequency (%) |
| 126763 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2590805 | |
| Common | 126763 | 4.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 358392 | |
| e | 260567 | |
| o | 234234 | 9.0% |
| d | 198899 | 7.7% |
| i | 189253 | 7.3% |
| l | 189253 | 7.3% |
| h | 186648 | 7.2% |
| m | 126763 | 4.9% |
| a | 126763 | 4.9% |
| y | 126763 | 4.9% |
| Other values (11) | 593270 |
Common
| Value | Count | Frequency (%) |
| 126763 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2717568 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 358392 | |
| e | 260567 | 9.6% |
| o | 234234 | 8.6% |
| d | 198899 | 7.3% |
| i | 189253 | 7.0% |
| l | 189253 | 7.0% |
| h | 186648 | 6.9% |
| m | 126763 | 4.7% |
| a | 126763 | 4.7% |
| y | 126763 | 4.7% |
| Other values (12) | 720033 |
detailed_household_summary_in_household
Categorical
High correlation 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Householder | |
|---|---|
| Child under 18 never married | |
| Spouse of householder | |
| Child 18 or older | |
| Other relative of householder | |
| Other values (3) |
Length
| Max length | 37 |
|---|---|
| Median length | 30 |
| Mean length | 20.147406 |
| Min length | 12 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Other relative of householder |
|---|---|
| 2nd row | Householder |
| 3rd row | Child 18 or older |
| 4th row | Child under 18 never married |
| 5th row | Child under 18 never married |
Common Values
| Value | Count | Frequency (%) |
| Householder | 75461 | |
| Child under 18 never married | 47318 | |
| Spouse of householder | 41684 | |
| Child 18 or older | 14416 | 7.3% |
| Other relative of householder | 9651 | 4.9% |
| Nonrelative of householder | 7585 | 3.9% |
| Group Quarters- Secondary individual | 132 | 0.1% |
| Child under 18 ever married | 47 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| householder | 134381 | |
| child | 61781 | |
| 18 | 61781 | |
| of | 58920 | |
| under | 47365 | 8.5% |
| married | 47365 | 8.5% |
| never | 47318 | 8.5% |
| spouse | 41684 | 7.5% |
| older | 14416 | 2.6% |
| or | 14416 | 2.6% |
| Other values (8) | 27462 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 558709 | |
| 556889 | ||
| o | 406047 | |
| r | 380088 | |
| d | 305704 | |
| h | 264733 | 6.7% |
| l | 227946 | 5.8% |
| u | 223826 | 5.7% |
| s | 176197 | 4.5% |
| i | 126778 | 3.2% |
| Other values (19) | 727898 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3077674 | |
| Space Separator | 556889 | 14.1% |
| Uppercase Letter | 196558 | 5.0% |
| Decimal Number | 123562 | 3.1% |
| Dash Punctuation | 132 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 558709 | |
| o | 406047 | |
| r | 380088 | |
| d | 305704 | |
| h | 264733 | |
| l | 227946 | |
| u | 223826 | |
| s | 176197 | 5.7% |
| i | 126778 | 4.1% |
| n | 102532 | 3.3% |
| Other values (8) | 305114 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 75461 | |
| C | 61781 | |
| S | 41816 | |
| O | 9651 | 4.9% |
| N | 7585 | 3.9% |
| G | 132 | 0.1% |
| Q | 132 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 61781 | |
| 1 | 61781 |
Space Separator
| Value | Count | Frequency (%) |
| 556889 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 132 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3274232 | |
| Common | 680583 | 17.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 558709 | |
| o | 406047 | |
| r | 380088 | |
| d | 305704 | |
| h | 264733 | |
| l | 227946 | |
| u | 223826 | |
| s | 176197 | 5.4% |
| i | 126778 | 3.9% |
| n | 102532 | 3.1% |
| Other values (15) | 501672 |
Common
| Value | Count | Frequency (%) |
| 556889 | ||
| 8 | 61781 | 9.1% |
| 1 | 61781 | 9.1% |
| - | 132 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3954815 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 558709 | |
| 556889 | ||
| o | 406047 | |
| r | 380088 | |
| d | 305704 | |
| h | 264733 | 6.7% |
| l | 227946 | 5.8% |
| u | 223826 | 5.7% |
| s | 176197 | 4.5% |
| i | 126778 | 3.2% |
| Other values (19) | 727898 |
instance_weight
Real number (ℝ)
| Distinct | 99800 |
|---|---|
| Distinct (%) | 50.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1743.2676 |
| Minimum | 37.87 |
|---|---|
| Maximum | 18656.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 37.87 |
|---|---|
| 5-th percentile | 394.9995 |
| Q1 | 1061.53 |
| median | 1620.175 |
| Q3 | 2194.06 |
| 95-th percentile | 3593.14 |
| Maximum | 18656.3 |
| Range | 18618.43 |
| Interquartile range (IQR) | 1132.53 |
Descriptive statistics
| Standard deviation | 996.94598 |
|---|---|
| Coefficient of variation (CV) | 0.57188351 |
| Kurtosis | 5.395758 |
| Mean | 1743.2676 |
| Median Absolute Deviation (MAD) | 564.265 |
| Skewness | 1.4314984 |
| Sum | 3.4219297 × 108 |
| Variance | 993901.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1601.4 | 32 | < 0.1% |
| 1191.21 | 32 | < 0.1% |
| 753.23 | 32 | < 0.1% |
| 707.9 | 31 | < 0.1% |
| 1787.34 | 31 | < 0.1% |
| 1317.51 | 31 | < 0.1% |
| 1070.15 | 30 | < 0.1% |
| 1033.83 | 28 | < 0.1% |
| 1002.02 | 28 | < 0.1% |
| 1839.19 | 28 | < 0.1% |
| Other values (99790) | 195991 |
| Value | Count | Frequency (%) |
| 37.87 | 1 | < 0.1% |
| 39.11 | 1 | < 0.1% |
| 40.67 | 2 | < 0.1% |
| 42.82 | 2 | < 0.1% |
| 43.26 | 3 | |
| 45.74 | 2 | < 0.1% |
| 47.83 | 6 | |
| 49.82 | 2 | < 0.1% |
| 52.43 | 1 | < 0.1% |
| 52.46 | 4 |
| Value | Count | Frequency (%) |
| 18656.3 | 1 | |
| 16349.2 | 1 | |
| 13911.5 | 1 | |
| 13145.1 | 1 | |
| 13114.2 | 1 | |
| 12960.2 | 1 | |
| 12399.9 | 1 | |
| 12184.5 | 1 | |
| 11958.4 | 1 | |
| 11863 | 1 |
migration_code_change_in_msa
Categorical
High correlation 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not in universe | |
|---|---|
| No movement | |
| MSA movement | |
| Non-MSA movement | 2802 |
| Mixed movement | 1402 |
Length
| Max length | 16 |
|---|---|
| Median length | 15 |
| Mean length | 13.187005 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | MSA movement |
| 3rd row | Not in universe |
| 4th row | No movement |
| 5th row | No movement |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 99864 | |
| No movement | 81128 | |
| MSA movement | 10572 | 5.4% |
| Non-MSA movement | 2802 | 1.4% |
| Mixed movement | 1402 | 0.7% |
| International | 526 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 99864 | |
| in | 99864 | |
| universe | 99864 | |
| movement | 95904 | |
| no | 81128 | |
| msa | 10572 | 2.1% |
| non-msa | 2802 | 0.6% |
| mixed | 1402 | 0.3% |
| international | 526 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 393464 | |
| n | 300012 | |
| 295632 | ||
| o | 280224 | |
| i | 201656 | |
| t | 196820 | |
| v | 195768 | |
| m | 191808 | |
| N | 183794 | |
| r | 100390 | 3.9% |
| Other values (11) | 248962 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2064252 | |
| Space Separator | 295632 | 11.4% |
| Uppercase Letter | 225844 | 8.7% |
| Dash Punctuation | 2802 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 393464 | |
| n | 300012 | |
| o | 280224 | |
| i | 201656 | |
| t | 196820 | |
| v | 195768 | |
| m | 191808 | |
| r | 100390 | 4.9% |
| s | 99864 | 4.8% |
| u | 99864 | 4.8% |
| Other values (4) | 4382 | 0.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 183794 | |
| M | 14776 | 6.5% |
| S | 13374 | 5.9% |
| A | 13374 | 5.9% |
| I | 526 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 295632 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2802 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2290096 | |
| Common | 298434 | 11.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 393464 | |
| n | 300012 | |
| o | 280224 | |
| i | 201656 | |
| t | 196820 | |
| v | 195768 | |
| m | 191808 | |
| N | 183794 | |
| r | 100390 | 4.4% |
| s | 99864 | 4.4% |
| Other values (9) | 146296 | 6.4% |
Common
| Value | Count | Frequency (%) |
| 295632 | ||
| - | 2802 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2588530 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 393464 | |
| n | 300012 | |
| 295632 | ||
| o | 280224 | |
| i | 201656 | |
| t | 196820 | |
| v | 195768 | |
| m | 191808 | |
| N | 183794 | |
| r | 100390 | 3.9% |
| Other values (11) | 248962 |
migration_code_change_in_reg
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not in universe | |
|---|---|
| Same area | |
| Different area | 5953 |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 12.190974 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Same area |
| 3rd row | Not in universe |
| 4th row | Same area |
| 5th row | Same area |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 99434 | |
| Same area | 90907 | |
| Different area | 5953 | 3.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 99434 | |
| in | 99434 | |
| universe | 99434 | |
| area | 96860 | |
| same | 90907 | |
| different | 5953 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 398541 | |
| 295728 | ||
| a | 284627 | |
| i | 204821 | |
| n | 204821 | |
| r | 202247 | |
| t | 105387 | 4.4% |
| N | 99434 | 4.2% |
| o | 99434 | 4.2% |
| u | 99434 | 4.2% |
| Other values (6) | 398541 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1900993 | |
| Space Separator | 295728 | 12.4% |
| Uppercase Letter | 196294 | 8.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 398541 | |
| a | 284627 | |
| i | 204821 | |
| n | 204821 | |
| r | 202247 | |
| t | 105387 | 5.5% |
| o | 99434 | 5.2% |
| u | 99434 | 5.2% |
| v | 99434 | 5.2% |
| s | 99434 | 5.2% |
| Other values (2) | 102813 | 5.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 99434 | |
| S | 90907 | |
| D | 5953 | 3.0% |
Space Separator
| Value | Count | Frequency (%) |
| 295728 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2097287 | |
| Common | 295728 | 12.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 398541 | |
| a | 284627 | |
| i | 204821 | |
| n | 204821 | |
| r | 202247 | |
| t | 105387 | 5.0% |
| N | 99434 | 4.7% |
| o | 99434 | 4.7% |
| u | 99434 | 4.7% |
| v | 99434 | 4.7% |
| Other values (5) | 299107 |
Common
| Value | Count | Frequency (%) |
| 295728 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2393015 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 398541 | |
| 295728 | ||
| a | 284627 | |
| i | 204821 | |
| n | 204821 | |
| r | 202247 | |
| t | 105387 | 4.4% |
| N | 99434 | 4.2% |
| o | 99434 | 4.2% |
| u | 99434 | 4.2% |
| Other values (6) | 398541 |
migration_code_move_within_reg
Categorical
High correlation  Imbalance 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| ? | |
|---|---|
| Nonmover | |
| Same county | 9779 |
| Different county same state | 2792 |
| Not in universe | 1419 |
| Other values (5) | 3161 |
Length
| Max length | 29 |
|---|---|
| Median length | 28 |
| Mean length | 6.1949881 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ? |
|---|---|
| 2nd row | Same county |
| 3rd row | ? |
| 4th row | Nonmover |
| 5th row | Nonmover |
Common Values
| Value | Count | Frequency (%) |
| ? | 98015 | |
| Nonmover | 81128 | |
| Same county | 9779 | 5.0% |
| Different county same state | 2792 | 1.4% |
| Not in universe | 1419 | 0.7% |
| Different state in South | 972 | 0.5% |
| Different state in West | 678 | 0.3% |
| Different state in Midwest | 551 | 0.3% |
| Abroad | 530 | 0.3% |
| Different state in Northeast | 430 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 98015 | ||
| nonmover | 81128 | |
| same | 12571 | 5.6% |
| county | 12571 | 5.6% |
| different | 5423 | 2.4% |
| state | 5423 | 2.4% |
| in | 4050 | 1.8% |
| not | 1419 | 0.6% |
| universe | 1419 | 0.6% |
| south | 972 | 0.4% |
| Other values (4) | 2189 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 225180 | ||
| o | 178178 | |
| e | 114465 | |
| n | 104591 | |
| ? | 98015 | |
| m | 93699 | |
| r | 88930 | 7.3% |
| N | 82977 | 6.8% |
| v | 82547 | 6.8% |
| t | 33320 | 2.7% |
| Other values (16) | 114137 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 791934 | |
| Space Separator | 225180 | 18.5% |
| Uppercase Letter | 100910 | 8.3% |
| Other Punctuation | 98015 | 8.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 178178 | |
| e | 114465 | |
| n | 104591 | |
| m | 93699 | |
| r | 88930 | |
| v | 82547 | |
| t | 33320 | 4.2% |
| a | 18954 | 2.4% |
| u | 14962 | 1.9% |
| c | 12571 | 1.6% |
| Other values (8) | 49717 | 6.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 82977 | |
| S | 10751 | 10.7% |
| D | 5423 | 5.4% |
| W | 678 | 0.7% |
| M | 551 | 0.5% |
| A | 530 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 225180 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 98015 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 892844 | |
| Common | 323195 | 26.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 178178 | |
| e | 114465 | |
| n | 104591 | |
| m | 93699 | |
| r | 88930 | |
| N | 82977 | |
| v | 82547 | |
| t | 33320 | 3.7% |
| a | 18954 | 2.1% |
| u | 14962 | 1.7% |
| Other values (14) | 80221 |
Common
| Value | Count | Frequency (%) |
| 225180 | ||
| ? | 98015 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1216039 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 225180 | ||
| o | 178178 | |
| e | 114465 | |
| n | 104591 | |
| ? | 98015 | |
| m | 93699 | |
| r | 88930 | 7.3% |
| N | 82977 | 6.8% |
| v | 82547 | 6.8% |
| t | 33320 | 2.7% |
| Other values (16) | 114137 |
live_in_this_house_1_year_ago
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not in universe | |
|---|---|
| Yes | |
| No |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 9.4919763 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | No |
| 3rd row | Not in universe |
| 4th row | Yes |
| 5th row | Yes |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 99434 | |
| Yes | 81128 | |
| No | 15732 | 8.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 99434 | |
| in | 99434 | |
| universe | 99434 | |
| yes | 81128 | |
| no | 15732 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 295728 | ||
| e | 279996 | |
| i | 198868 | |
| n | 198868 | |
| s | 180562 | |
| N | 115166 | 6.2% |
| o | 115166 | 6.2% |
| t | 99434 | 5.3% |
| u | 99434 | 5.3% |
| v | 99434 | 5.3% |
| Other values (2) | 180562 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1371196 | |
| Space Separator | 295728 | 15.9% |
| Uppercase Letter | 196294 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 279996 | |
| i | 198868 | |
| n | 198868 | |
| s | 180562 | |
| o | 115166 | |
| t | 99434 | 7.3% |
| u | 99434 | 7.3% |
| v | 99434 | 7.3% |
| r | 99434 | 7.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 115166 | |
| Y | 81128 |
Space Separator
| Value | Count | Frequency (%) |
| 295728 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1567490 | |
| Common | 295728 | 15.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 279996 | |
| i | 198868 | |
| n | 198868 | |
| s | 180562 | |
| N | 115166 | |
| o | 115166 | |
| t | 99434 | 6.3% |
| u | 99434 | 6.3% |
| v | 99434 | 6.3% |
| r | 99434 | 6.3% |
Common
| Value | Count | Frequency (%) |
| 295728 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1863218 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 295728 | ||
| e | 279996 | |
| i | 198868 | |
| n | 198868 | |
| s | 180562 | |
| N | 115166 | 6.2% |
| o | 115166 | 6.2% |
| t | 99434 | 5.3% |
| u | 99434 | 5.3% |
| v | 99434 | 5.3% |
| Other values (2) | 180562 |
migration_prev_res_in_sunbelt
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not in universe | |
|---|---|
| No | 9959 |
| Yes | 5773 |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 14.067669 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Yes |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 180562 | |
| No | 9959 | 5.1% |
| Yes | 5773 | 2.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 180562 | |
| in | 180562 | |
| universe | 180562 | |
| no | 9959 | 1.8% |
| yes | 5773 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 376856 | ||
| e | 366897 | |
| i | 361124 | |
| n | 361124 | |
| N | 190521 | |
| o | 190521 | |
| s | 186335 | |
| t | 180562 | |
| u | 180562 | |
| v | 180562 | |
| Other values (2) | 186335 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2188249 | |
| Space Separator | 376856 | 13.6% |
| Uppercase Letter | 196294 | 7.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 366897 | |
| i | 361124 | |
| n | 361124 | |
| o | 190521 | |
| s | 186335 | |
| t | 180562 | |
| u | 180562 | |
| v | 180562 | |
| r | 180562 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 190521 | |
| Y | 5773 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 376856 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2384543 | |
| Common | 376856 | 13.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 366897 | |
| i | 361124 | |
| n | 361124 | |
| N | 190521 | |
| o | 190521 | |
| s | 186335 | |
| t | 180562 | |
| u | 180562 | |
| v | 180562 | |
| r | 180562 |
Common
| Value | Count | Frequency (%) |
| 376856 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2761399 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 376856 | ||
| e | 366897 | |
| i | 361124 | |
| n | 361124 | |
| N | 190521 | |
| o | 190521 | |
| s | 186335 | |
| t | 180562 | |
| u | 180562 | |
| v | 180562 | |
| Other values (2) | 186335 |
num_persons_worked_for_employer
Real number (ℝ)
High correlation  Zeros 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.9881046 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 92770 |
| Zeros (%) | 47.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.3710177 |
|---|---|
| Coefficient of variation (CV) | 1.1926021 |
| Kurtosis | -1.1180198 |
| Mean | 1.9881046 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.72692272 |
| Sum | 390253 |
| Variance | 5.6217248 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 92770 | |
| 6 | 36507 | 18.6% |
| 1 | 23103 | 11.8% |
| 4 | 14377 | 7.3% |
| 3 | 13424 | 6.8% |
| 2 | 10079 | 5.1% |
| 5 | 6034 | 3.1% |
| Value | Count | Frequency (%) |
| 0 | 92770 | |
| 1 | 23103 | 11.8% |
| 2 | 10079 | 5.1% |
| 3 | 13424 | 6.8% |
| 4 | 14377 | 7.3% |
| 5 | 6034 | 3.1% |
| 6 | 36507 | 18.6% |
| Value | Count | Frequency (%) |
| 6 | 36507 | 18.6% |
| 5 | 6034 | 3.1% |
| 4 | 14377 | 7.3% |
| 3 | 13424 | 6.8% |
| 2 | 10079 | 5.1% |
| 1 | 23103 | 11.8% |
| 0 | 92770 |
family_members_under_18
Categorical
High correlation  Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not in universe | |
|---|---|
| Both parents present | |
| Mother only present | 12517 |
| Father only present | 1871 |
| Neither parent present | 1638 |
Length
| Max length | 23 |
|---|---|
| Median length | 16 |
| Mean length | 17.271323 |
| Min length | 16 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Both parents present |
| 5th row | Both parents present |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 144161 | |
| Both parents present | 36107 | 18.4% |
| Mother only present | 12517 | 6.4% |
| Father only present | 1871 | 1.0% |
| Neither parent present | 1638 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 144161 | |
| in | 144161 | |
| universe | 144161 | |
| present | 52133 | 8.9% |
| both | 36107 | 6.1% |
| parents | 36107 | 6.1% |
| only | 14388 | 2.4% |
| mother | 12517 | 2.1% |
| father | 1871 | 0.3% |
| neither | 1638 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 588882 | ||
| e | 447997 | |
| n | 392588 | |
| i | 289960 | |
| t | 286172 | |
| r | 250065 | |
| s | 232401 | 6.9% |
| o | 207173 | 6.1% |
| N | 145799 | 4.3% |
| u | 144161 | 4.3% |
| Other values (9) | 405059 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2605081 | |
| Space Separator | 588882 | 17.4% |
| Uppercase Letter | 196294 | 5.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 447997 | |
| n | 392588 | |
| i | 289960 | |
| t | 286172 | |
| r | 250065 | |
| s | 232401 | |
| o | 207173 | |
| u | 144161 | 5.5% |
| v | 144161 | 5.5% |
| p | 89878 | 3.5% |
| Other values (4) | 120525 | 4.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 145799 | |
| B | 36107 | 18.4% |
| M | 12517 | 6.4% |
| F | 1871 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 588882 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2801375 | |
| Common | 588882 | 17.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 447997 | |
| n | 392588 | |
| i | 289960 | |
| t | 286172 | |
| r | 250065 | |
| s | 232401 | |
| o | 207173 | |
| N | 145799 | 5.2% |
| u | 144161 | 5.1% |
| v | 144161 | 5.1% |
| Other values (8) | 260898 |
Common
| Value | Count | Frequency (%) |
| 588882 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3390257 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 588882 | ||
| e | 447997 | |
| n | 392588 | |
| i | 289960 | |
| t | 286172 | |
| r | 250065 | |
| s | 232401 | 6.9% |
| o | 207173 | 6.1% |
| N | 145799 | 4.3% |
| u | 144161 | 4.3% |
| Other values (9) | 405059 |
country_of_birth_father
Categorical
High correlation  Imbalance 
| Distinct | 43 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| United-States | |
|---|---|
| Mexico | 9948 |
| ? | 6703 |
| Puerto-Rico | 2676 |
| Italy | 2212 |
| Other values (38) |
Length
| Max length | 29 |
|---|---|
| Median length | 14 |
| Mean length | 12.650193 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | United-States |
|---|---|
| 2nd row | United-States |
| 3rd row | Vietnam |
| 4th row | United-States |
| 5th row | United-States |
Common Values
| Value | Count | Frequency (%) |
| United-States | 156037 | |
| Mexico | 9948 | 5.1% |
| ? | 6703 | 3.4% |
| Puerto-Rico | 2676 | 1.4% |
| Italy | 2212 | 1.1% |
| Canada | 1380 | 0.7% |
| Germany | 1356 | 0.7% |
| Dominican-Republic | 1284 | 0.7% |
| Poland | 1210 | 0.6% |
| Philippines | 1152 | 0.6% |
| Other values (33) | 12336 | 6.3% |
Length
| Value | Count | Frequency (%) |
| united-states | 156037 | |
| mexico | 9948 | 5.0% |
| 6703 | 3.4% | |
| puerto-rico | 2676 | 1.4% |
| italy | 2212 | 1.1% |
| canada | 1380 | 0.7% |
| germany | 1356 | 0.7% |
| dominican-republic | 1284 | 0.6% |
| poland | 1210 | 0.6% |
| philippines | 1152 | 0.6% |
| Other values (39) | 13608 | 6.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 475783 | |
| e | 332247 | |
| 197566 | ||
| a | 182642 | 7.4% |
| i | 180939 | 7.3% |
| n | 170161 | 6.9% |
| d | 162934 | 6.6% |
| - | 161189 | 6.5% |
| S | 158114 | 6.4% |
| s | 157805 | 6.4% |
| Other values (37) | 303777 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1764787 | |
| Uppercase Letter | 352482 | 14.2% |
| Space Separator | 197566 | 8.0% |
| Dash Punctuation | 161189 | 6.5% |
| Other Punctuation | 6815 | 0.3% |
| Open Punctuation | 159 | < 0.1% |
| Close Punctuation | 159 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 475783 | |
| e | 332247 | |
| a | 182642 | 10.3% |
| i | 180939 | 10.3% |
| n | 170161 | 9.6% |
| d | 162934 | 9.2% |
| s | 157805 | 8.9% |
| o | 22709 | 1.3% |
| c | 17286 | 1.0% |
| l | 11397 | 0.6% |
| Other values (11) | 50884 | 2.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 158114 | |
| U | 156355 | |
| M | 9948 | 2.8% |
| P | 5785 | 1.6% |
| C | 4164 | 1.2% |
| R | 3960 | 1.1% |
| I | 3691 | 1.0% |
| G | 2302 | 0.7% |
| E | 2151 | 0.6% |
| D | 1284 | 0.4% |
| Other values (10) | 4728 | 1.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 6703 | |
| & | 112 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 197566 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 161189 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 159 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 159 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2117269 | |
| Common | 365888 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 475783 | |
| e | 332247 | |
| a | 182642 | 8.6% |
| i | 180939 | 8.5% |
| n | 170161 | 8.0% |
| d | 162934 | 7.7% |
| S | 158114 | 7.5% |
| s | 157805 | 7.5% |
| U | 156355 | 7.4% |
| o | 22709 | 1.1% |
| Other values (31) | 117580 | 5.6% |
Common
| Value | Count | Frequency (%) |
| 197566 | ||
| - | 161189 | |
| ? | 6703 | 1.8% |
| ( | 159 | < 0.1% |
| ) | 159 | < 0.1% |
| & | 112 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2483157 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 475783 | |
| e | 332247 | |
| 197566 | ||
| a | 182642 | 7.4% |
| i | 180939 | 7.3% |
| n | 170161 | 6.9% |
| d | 162934 | 6.6% |
| - | 161189 | 6.5% |
| S | 158114 | 6.4% |
| s | 157805 | 6.4% |
| Other values (37) | 303777 |
country_of_birth_mother
Categorical
High correlation  Imbalance 
| Distinct | 43 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| United-States | |
|---|---|
| Mexico | 9721 |
| ? | 6107 |
| Puerto-Rico | 2468 |
| Italy | 1844 |
| Other values (38) |
Length
| Max length | 29 |
|---|---|
| Median length | 14 |
| Mean length | 12.703582 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | United-States |
|---|---|
| 2nd row | United-States |
| 3rd row | Vietnam |
| 4th row | United-States |
| 5th row | United-States |
Common Values
| Value | Count | Frequency (%) |
| United-States | 157355 | |
| Mexico | 9721 | 5.0% |
| ? | 6107 | 3.1% |
| Puerto-Rico | 2468 | 1.3% |
| Italy | 1844 | 0.9% |
| Canada | 1451 | 0.7% |
| Germany | 1382 | 0.7% |
| Philippines | 1228 | 0.6% |
| Poland | 1109 | 0.6% |
| El-Salvador | 1107 | 0.6% |
| Other values (33) | 12522 | 6.4% |
Length
| Value | Count | Frequency (%) |
| united-states | 157355 | |
| mexico | 9721 | 4.9% |
| 6107 | 3.1% | |
| puerto-rico | 2468 | 1.2% |
| italy | 1844 | 0.9% |
| canada | 1451 | 0.7% |
| germany | 1382 | 0.7% |
| philippines | 1228 | 0.6% |
| poland | 1109 | 0.6% |
| el-salvador | 1107 | 0.6% |
| Other values (39) | 13864 | 7.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 479197 | |
| e | 334332 | |
| 197636 | ||
| a | 183901 | 7.4% |
| i | 181335 | 7.3% |
| n | 171510 | 6.9% |
| d | 164509 | 6.6% |
| - | 162233 | 6.5% |
| S | 159624 | 6.4% |
| s | 159182 | 6.4% |
| Other values (37) | 300178 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1773075 | |
| Uppercase Letter | 354174 | 14.2% |
| Space Separator | 197636 | 7.9% |
| Dash Punctuation | 162233 | 6.5% |
| Other Punctuation | 6205 | 0.2% |
| Open Punctuation | 157 | < 0.1% |
| Close Punctuation | 157 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 479197 | |
| e | 334332 | |
| a | 183901 | 10.4% |
| i | 181335 | 10.2% |
| n | 171510 | 9.7% |
| d | 164509 | 9.3% |
| s | 159182 | 9.0% |
| o | 21918 | 1.2% |
| c | 16382 | 0.9% |
| l | 11183 | 0.6% |
| Other values (11) | 49626 | 2.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 159624 | |
| U | 157669 | |
| M | 9721 | 2.7% |
| P | 5533 | 1.6% |
| C | 4082 | 1.2% |
| R | 3565 | 1.0% |
| I | 3378 | 1.0% |
| E | 2383 | 0.7% |
| G | 2242 | 0.6% |
| D | 1097 | 0.3% |
| Other values (10) | 4880 | 1.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 6107 | |
| & | 98 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 197636 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 162233 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 157 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 157 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2127249 | |
| Common | 366388 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 479197 | |
| e | 334332 | |
| a | 183901 | 8.6% |
| i | 181335 | 8.5% |
| n | 171510 | 8.1% |
| d | 164509 | 7.7% |
| S | 159624 | 7.5% |
| s | 159182 | 7.5% |
| U | 157669 | 7.4% |
| o | 21918 | 1.0% |
| Other values (31) | 114072 | 5.4% |
Common
| Value | Count | Frequency (%) |
| 197636 | ||
| - | 162233 | |
| ? | 6107 | 1.7% |
| ( | 157 | < 0.1% |
| ) | 157 | < 0.1% |
| & | 98 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2493637 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 479197 | |
| e | 334332 | |
| 197636 | ||
| a | 183901 | 7.4% |
| i | 181335 | 7.3% |
| n | 171510 | 6.9% |
| d | 164509 | 6.6% |
| - | 162233 | 6.5% |
| S | 159624 | 6.4% |
| s | 159182 | 6.4% |
| Other values (37) | 300178 |
country_of_birth_self
Categorical
High correlation  Imbalance 
| Distinct | 43 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| United-States | |
|---|---|
| Mexico | 5759 |
| ? | 3389 |
| Puerto-Rico | 1400 |
| Germany | 850 |
| Other values (38) | 11113 |
Length
| Max length | 29 |
|---|---|
| Median length | 14 |
| Mean length | 13.268597 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | United-States |
|---|---|
| 2nd row | United-States |
| 3rd row | Vietnam |
| 4th row | United-States |
| 5th row | United-States |
Common Values
| Value | Count | Frequency (%) |
| United-States | 173783 | |
| Mexico | 5759 | 2.9% |
| ? | 3389 | 1.7% |
| Puerto-Rico | 1400 | 0.7% |
| Germany | 850 | 0.4% |
| Philippines | 844 | 0.4% |
| Cuba | 836 | 0.4% |
| Canada | 700 | 0.4% |
| El-Salvador | 689 | 0.4% |
| Dominican-Republic | 687 | 0.3% |
| Other values (33) | 7357 | 3.7% |
Length
| Value | Count | Frequency (%) |
| united-states | 173783 | |
| mexico | 5759 | 2.9% |
| 3389 | 1.7% | |
| puerto-rico | 1400 | 0.7% |
| germany | 850 | 0.4% |
| philippines | 844 | 0.4% |
| cuba | 836 | 0.4% |
| canada | 700 | 0.4% |
| el-salvador | 689 | 0.3% |
| dominican-republic | 687 | 0.3% |
| Other values (39) | 8404 | 4.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 525111 | |
| e | 359441 | |
| 197341 | 7.6% | |
| a | 189262 | 7.3% |
| i | 188898 | 7.3% |
| n | 181941 | 7.0% |
| d | 177412 | 6.8% |
| - | 176701 | 6.8% |
| S | 175256 | 6.7% |
| s | 174965 | 6.7% |
| Other values (37) | 258218 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1855854 | |
| Uppercase Letter | 370957 | 14.2% |
| Space Separator | 197341 | 7.6% |
| Dash Punctuation | 176701 | 6.8% |
| Other Punctuation | 3455 | 0.1% |
| Open Punctuation | 119 | < 0.1% |
| Close Punctuation | 119 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 525111 | |
| e | 359441 | |
| a | 189262 | 10.2% |
| i | 188898 | 10.2% |
| n | 181941 | 9.8% |
| d | 177412 | 9.6% |
| s | 174965 | 9.4% |
| o | 12963 | 0.7% |
| c | 9791 | 0.5% |
| x | 5759 | 0.3% |
| Other values (11) | 30311 | 1.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 175256 | |
| U | 174021 | |
| M | 5759 | 1.6% |
| P | 3095 | 0.8% |
| C | 2542 | 0.7% |
| R | 2087 | 0.6% |
| G | 1459 | 0.4% |
| E | 1402 | 0.4% |
| I | 1237 | 0.3% |
| D | 687 | 0.2% |
| Other values (10) | 3412 | 0.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 3389 | |
| & | 66 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 197341 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 176701 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 119 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 119 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2226811 | |
| Common | 377735 | 14.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 525111 | |
| e | 359441 | |
| a | 189262 | 8.5% |
| i | 188898 | 8.5% |
| n | 181941 | 8.2% |
| d | 177412 | 8.0% |
| S | 175256 | 7.9% |
| s | 174965 | 7.9% |
| U | 174021 | 7.8% |
| o | 12963 | 0.6% |
| Other values (31) | 67541 | 3.0% |
Common
| Value | Count | Frequency (%) |
| 197341 | ||
| - | 176701 | |
| ? | 3389 | 0.9% |
| ( | 119 | < 0.1% |
| ) | 119 | < 0.1% |
| & | 66 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2604546 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 525111 | |
| e | 359441 | |
| 197341 | 7.6% | |
| a | 189262 | 7.3% |
| i | 188898 | 7.3% |
| n | 181941 | 7.0% |
| d | 177412 | 6.8% |
| - | 176701 | 6.8% |
| S | 175256 | 6.7% |
| s | 174965 | 6.7% |
| Other values (37) | 258218 |
citizenship
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Native | |
|---|---|
| Foreign | 13385 |
| Naturalized | 5851 |
Length
| Max length | 11 |
|---|---|
| Median length | 6 |
| Mean length | 6.2172252 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Native |
|---|---|
| 2nd row | Native |
| 3rd row | Foreign |
| 4th row | Native |
| 5th row | Native |
Common Values
| Value | Count | Frequency (%) |
| Native | 177058 | |
| Foreign | 13385 | 6.8% |
| Naturalized | 5851 | 3.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| native | 177058 | |
| foreign | 13385 | 6.8% |
| naturalized | 5851 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 196294 | |
| e | 196294 | |
| a | 188760 | |
| N | 182909 | |
| t | 182909 | |
| v | 177058 | |
| r | 19236 | 1.6% |
| F | 13385 | 1.1% |
| o | 13385 | 1.1% |
| g | 13385 | 1.1% |
| Other values (5) | 36789 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1024110 | |
| Uppercase Letter | 196294 | 16.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 196294 | |
| e | 196294 | |
| a | 188760 | |
| t | 182909 | |
| v | 177058 | |
| r | 19236 | 1.9% |
| o | 13385 | 1.3% |
| g | 13385 | 1.3% |
| n | 13385 | 1.3% |
| u | 5851 | 0.6% |
| Other values (3) | 17553 | 1.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 182909 | |
| F | 13385 | 6.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1220404 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 196294 | |
| e | 196294 | |
| a | 188760 | |
| N | 182909 | |
| t | 182909 | |
| v | 177058 | |
| r | 19236 | 1.6% |
| F | 13385 | 1.1% |
| o | 13385 | 1.1% |
| g | 13385 | 1.1% |
| Other values (5) | 36789 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1220404 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 196294 | |
| e | 196294 | |
| a | 188760 | |
| N | 182909 | |
| t | 182909 | |
| v | 177058 | |
| r | 19236 | 1.6% |
| F | 13385 | 1.1% |
| o | 13385 | 1.1% |
| g | 13385 | 1.1% |
| Other values (5) | 36789 | 3.0% |
own_business_or_self_employed
Categorical
Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not in universe | |
|---|---|
| No | 16151 |
| Yes | 2698 |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 13.765428 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 177445 | |
| No | 16151 | 8.2% |
| Yes | 2698 | 1.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 177445 | |
| in | 177445 | |
| universe | 177445 | |
| no | 16151 | 2.9% |
| yes | 2698 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 357588 | |
| 354890 | ||
| i | 354890 | |
| n | 354890 | |
| N | 193596 | |
| o | 193596 | |
| s | 180143 | |
| t | 177445 | |
| u | 177445 | |
| v | 177445 | |
| Other values (2) | 180143 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2150887 | |
| Space Separator | 354890 | 13.1% |
| Uppercase Letter | 196294 | 7.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 357588 | |
| i | 354890 | |
| n | 354890 | |
| o | 193596 | |
| s | 180143 | |
| t | 177445 | |
| u | 177445 | |
| v | 177445 | |
| r | 177445 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 193596 | |
| Y | 2698 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 354890 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2347181 | |
| Common | 354890 | 13.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 357588 | |
| i | 354890 | |
| n | 354890 | |
| N | 193596 | |
| o | 193596 | |
| s | 180143 | |
| t | 177445 | |
| u | 177445 | |
| v | 177445 | |
| r | 177445 |
Common
| Value | Count | Frequency (%) |
| 354890 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2702071 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 357588 | |
| 354890 | ||
| i | 354890 | |
| n | 354890 | |
| N | 193596 | |
| o | 193596 | |
| s | 180143 | |
| t | 177445 | |
| u | 177445 | |
| v | 177445 | |
| Other values (2) | 180143 |
fill_inc_questionnaire_for_veteran's_admin
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not in universe | |
|---|---|
| No | 1593 |
| Yes | 391 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 15.870597 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 194310 | |
| No | 1593 | 0.8% |
| Yes | 391 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 194310 | |
| in | 194310 | |
| universe | 194310 | |
| no | 1593 | 0.3% |
| yes | 391 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 584914 | ||
| e | 389011 | |
| i | 388620 | |
| n | 388620 | |
| N | 195903 | 6.3% |
| o | 195903 | 6.3% |
| s | 194701 | 6.2% |
| t | 194310 | 6.2% |
| u | 194310 | 6.2% |
| v | 194310 | 6.2% |
| Other values (2) | 194701 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2334095 | |
| Space Separator | 584914 | 18.8% |
| Uppercase Letter | 196294 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 389011 | |
| i | 388620 | |
| n | 388620 | |
| o | 195903 | |
| s | 194701 | |
| t | 194310 | |
| u | 194310 | |
| v | 194310 | |
| r | 194310 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 195903 | |
| Y | 391 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 584914 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2530389 | |
| Common | 584914 | 18.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 389011 | |
| i | 388620 | |
| n | 388620 | |
| N | 195903 | |
| o | 195903 | |
| s | 194701 | |
| t | 194310 | |
| u | 194310 | |
| v | 194310 | |
| r | 194310 |
Common
| Value | Count | Frequency (%) |
| 584914 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3115303 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 584914 | ||
| e | 389011 | |
| i | 388620 | |
| n | 388620 | |
| N | 195903 | 6.3% |
| o | 195903 | 6.3% |
| s | 194701 | 6.2% |
| t | 194310 | 6.2% |
| u | 194310 | 6.2% |
| v | 194310 | 6.2% |
| Other values (2) | 194701 | 6.2% |
veterans_benefits
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| Not a Veteran | |
|---|---|
| Not in universe | |
| Veteran | 1984 |
Length
| Max length | 15 |
|---|---|
| Median length | 13 |
| Mean length | 13.391066 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not a Veteran |
|---|---|
| 2nd row | Not a Veteran |
| 3rd row | Not a Veteran |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not a Veteran | 149976 | |
| Not in universe | 44334 | 22.6% |
| Veteran | 1984 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 194310 | |
| veteran | 151960 | |
| a | 149976 | |
| in | 44334 | 7.6% |
| universe | 44334 | 7.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 392588 | |
| 388620 | ||
| t | 346270 | |
| a | 301936 | |
| n | 240628 | |
| r | 196294 | |
| N | 194310 | |
| o | 194310 | |
| V | 151960 | 5.8% |
| i | 88668 | 3.4% |
| Other values (3) | 133002 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1893696 | |
| Space Separator | 388620 | 14.8% |
| Uppercase Letter | 346270 | 13.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 392588 | |
| t | 346270 | |
| a | 301936 | |
| n | 240628 | |
| r | 196294 | |
| o | 194310 | |
| i | 88668 | 4.7% |
| u | 44334 | 2.3% |
| v | 44334 | 2.3% |
| s | 44334 | 2.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 194310 | |
| V | 151960 |
Space Separator
| Value | Count | Frequency (%) |
| 388620 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2239966 | |
| Common | 388620 | 14.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 392588 | |
| t | 346270 | |
| a | 301936 | |
| n | 240628 | |
| r | 196294 | |
| N | 194310 | |
| o | 194310 | |
| V | 151960 | 6.8% |
| i | 88668 | 4.0% |
| u | 44334 | 2.0% |
| Other values (2) | 88668 | 4.0% |
Common
| Value | Count | Frequency (%) |
| 388620 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2628586 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 392588 | |
| 388620 | ||
| t | 346270 | |
| a | 301936 | |
| n | 240628 | |
| r | 196294 | |
| N | 194310 | |
| o | 194310 | |
| V | 151960 | 5.8% |
| i | 88668 | 3.4% |
| Other values (3) | 133002 | 5.1% |
weeks_worked_in_year
Real number (ℝ)
High correlation  Zeros 
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.553889 |
| Minimum | 0 |
|---|---|
| Maximum | 52 |
| Zeros | 92770 |
| Zeros (%) | 47.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 12 |
| Q3 | 52 |
| 95-th percentile | 52 |
| Maximum | 52 |
| Range | 52 |
| Interquartile range (IQR) | 52 |
Descriptive statistics
| Standard deviation | 24.428588 |
|---|---|
| Coefficient of variation (CV) | 1.0371361 |
| Kurtosis | -1.8743188 |
| Mean | 23.553889 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.18056487 |
| Sum | 4623487 |
| Variance | 596.75593 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 92770 | |
| 52 | 70308 | |
| 40 | 2790 | 1.4% |
| 50 | 2304 | 1.2% |
| 26 | 2268 | 1.2% |
| 48 | 1806 | 0.9% |
| 12 | 1777 | 0.9% |
| 30 | 1378 | 0.7% |
| 20 | 1330 | 0.7% |
| 8 | 1125 | 0.6% |
| Other values (43) | 18438 | 9.4% |
| Value | Count | Frequency (%) |
| 0 | 92770 | |
| 1 | 464 | 0.2% |
| 2 | 457 | 0.2% |
| 3 | 417 | 0.2% |
| 4 | 757 | 0.4% |
| 5 | 309 | 0.2% |
| 6 | 645 | 0.3% |
| 7 | 152 | 0.1% |
| 8 | 1125 | 0.6% |
| 9 | 239 | 0.1% |
| Value | Count | Frequency (%) |
| 52 | 70308 | |
| 51 | 819 | 0.4% |
| 50 | 2304 | 1.2% |
| 49 | 509 | 0.3% |
| 48 | 1806 | 0.9% |
| 47 | 278 | 0.1% |
| 46 | 708 | 0.4% |
| 45 | 669 | 0.3% |
| 44 | 845 | 0.4% |
| 43 | 374 | 0.2% |
year
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| 1994 | |
|---|---|
| 1995 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1995 |
|---|---|
| 2nd row | 1994 |
| 3rd row | 1995 |
| 4th row | 1994 |
| 5th row | 1994 |
Common Values
| Value | Count | Frequency (%) |
| 1994 | 98279 | |
| 1995 | 98015 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1994 | 98279 | |
| 1995 | 98015 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 392588 | |
| 1 | 196294 | |
| 4 | 98279 | 12.5% |
| 5 | 98015 | 12.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 785176 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 392588 | |
| 1 | 196294 | |
| 4 | 98279 | 12.5% |
| 5 | 98015 | 12.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 785176 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 392588 | |
| 1 | 196294 | |
| 4 | 98279 | 12.5% |
| 5 | 98015 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 785176 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 392588 | |
| 1 | 196294 | |
| 4 | 98279 | 12.5% |
| 5 | 98015 | 12.5% |
target
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| 1 | |
|---|---|
| 0 | 12382 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 183912 | |
| 0 | 12382 | 6.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 183912 | |
| 0 | 12382 | 6.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 183912 | |
| 0 | 12382 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 196294 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 183912 | |
| 0 | 12382 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 196294 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 183912 | |
| 0 | 12382 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 196294 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 183912 | |
| 0 | 12382 | 6.3% |
Interactions
Correlations
| age | capital_gains | capital_losses | citizenship | class_of_worker | country_of_birth_father | country_of_birth_mother | country_of_birth_self | detailed_household_and_family_stat | detailed_household_summary_in_household | detailed_industry_recode | detailed_occupation_recode | dividends_from_stocks | education | enroll_in_edu_inst_last_wk | family_members_under_18 | fill_inc_questionnaire_for_veteran's_admin | full_or_part_time_employment_stat | hispanic_origin | instance_weight | live_in_this_house_1_year_ago | major_industry_code | major_occupation_code | marital_stat | member_of_a_labor_union | migration_code_change_in_msa | migration_code_change_in_reg | migration_code_move_within_reg | migration_prev_res_in_sunbelt | num_persons_worked_for_employer | own_business_or_self_employed | race | reason_for_unemployment | region_of_previous_residence | sex | target | tax_filer_stat | veterans_benefits | wage_per_hour | weeks_worked_in_year | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age | 1.000 | 0.127 | 0.068 | 0.121 | 0.365 | 0.092 | 0.086 | 0.062 | 0.507 | 0.397 | 0.245 | 0.249 | 0.251 | 0.444 | 0.434 | 0.481 | 0.090 | 0.315 | 0.051 | 0.004 | 0.115 | 0.244 | 0.243 | 0.425 | 0.172 | 0.073 | 0.066 | 0.086 | 0.108 | 0.226 | 0.192 | 0.056 | 0.080 | 0.069 | 0.061 | 0.242 | 0.585 | 0.644 | 0.039 | 0.269 | 0.009 |
| capital_gains | 0.127 | 1.000 | -0.028 | 0.012 | 0.048 | 0.009 | 0.009 | 0.004 | 0.042 | 0.051 | 0.049 | 0.097 | 0.117 | 0.075 | 0.022 | 0.029 | 0.000 | 0.030 | 0.012 | 0.003 | 0.008 | 0.047 | 0.064 | 0.032 | 0.019 | 0.014 | 0.006 | 0.014 | 0.008 | 0.113 | 0.027 | 0.011 | 0.004 | 0.014 | 0.058 | 0.318 | 0.059 | 0.037 | 0.005 | 0.130 | 0.006 |
| capital_losses | 0.068 | -0.028 | 1.000 | 0.007 | 0.051 | 0.010 | 0.006 | 0.000 | 0.047 | 0.054 | 0.038 | 0.047 | 0.067 | 0.054 | 0.021 | 0.039 | 0.010 | 0.034 | 0.006 | 0.009 | 0.004 | 0.037 | 0.040 | 0.045 | 0.020 | 0.002 | 0.002 | 0.002 | 0.004 | 0.097 | 0.023 | 0.010 | 0.004 | 0.000 | 0.073 | 0.172 | 0.095 | 0.054 | 0.005 | 0.105 | 0.003 |
| citizenship | 0.121 | 0.012 | 0.007 | 1.000 | 0.057 | 0.536 | 0.542 | 0.720 | 0.107 | 0.111 | 0.098 | 0.114 | 0.009 | 0.137 | 0.018 | 0.086 | 0.016 | 0.046 | 0.397 | 0.044 | 0.031 | 0.083 | 0.093 | 0.102 | 0.013 | 0.092 | 0.021 | 0.092 | 0.029 | 0.046 | 0.024 | 0.240 | 0.027 | 0.092 | 0.010 | 0.038 | 0.055 | 0.084 | 0.017 | 0.035 | 0.012 |
| class_of_worker | 0.365 | 0.048 | 0.051 | 0.057 | 1.000 | 0.055 | 0.053 | 0.052 | 0.261 | 0.276 | 0.639 | 0.599 | 0.015 | 0.318 | 0.090 | 0.275 | 0.026 | 0.363 | 0.047 | 0.018 | 0.037 | 0.655 | 0.549 | 0.212 | 0.278 | 0.029 | 0.021 | 0.052 | 0.036 | 0.510 | 0.208 | 0.047 | 0.436 | 0.027 | 0.123 | 0.235 | 0.486 | 0.389 | 0.083 | 0.448 | 0.003 |
| country_of_birth_father | 0.092 | 0.009 | 0.010 | 0.536 | 0.055 | 1.000 | 0.771 | 0.659 | 0.095 | 0.067 | 0.029 | 0.036 | 0.005 | 0.114 | 0.036 | 0.069 | 0.024 | 0.062 | 0.532 | 0.044 | 0.037 | 0.034 | 0.049 | 0.086 | 0.043 | 0.053 | 0.029 | 0.040 | 0.047 | 0.043 | 0.049 | 0.440 | 0.019 | 0.062 | 0.025 | 0.071 | 0.077 | 0.080 | 0.005 | 0.030 | 0.030 |
| country_of_birth_mother | 0.086 | 0.009 | 0.006 | 0.542 | 0.053 | 0.771 | 1.000 | 0.686 | 0.091 | 0.064 | 0.029 | 0.036 | 0.000 | 0.112 | 0.036 | 0.066 | 0.024 | 0.059 | 0.542 | 0.043 | 0.036 | 0.034 | 0.048 | 0.083 | 0.042 | 0.053 | 0.029 | 0.040 | 0.048 | 0.041 | 0.046 | 0.443 | 0.018 | 0.062 | 0.024 | 0.070 | 0.072 | 0.076 | 0.007 | 0.028 | 0.030 |
| country_of_birth_self | 0.062 | 0.004 | 0.000 | 0.720 | 0.052 | 0.659 | 0.686 | 1.000 | 0.088 | 0.063 | 0.035 | 0.042 | 0.000 | 0.123 | 0.029 | 0.062 | 0.015 | 0.045 | 0.489 | 0.032 | 0.033 | 0.039 | 0.055 | 0.070 | 0.028 | 0.059 | 0.028 | 0.045 | 0.042 | 0.036 | 0.025 | 0.380 | 0.024 | 0.066 | 0.028 | 0.059 | 0.058 | 0.084 | 0.007 | 0.021 | 0.024 |
| detailed_household_and_family_stat | 0.507 | 0.042 | 0.047 | 0.107 | 0.261 | 0.095 | 0.091 | 0.088 | 1.000 | 0.981 | 0.266 | 0.275 | 0.027 | 0.428 | 0.205 | 0.527 | 0.048 | 0.180 | 0.081 | 0.033 | 0.073 | 0.267 | 0.268 | 0.487 | 0.105 | 0.062 | 0.047 | 0.089 | 0.069 | 0.261 | 0.095 | 0.075 | 0.040 | 0.060 | 0.057 | 0.195 | 0.538 | 0.504 | 0.052 | 0.276 | 0.005 |
| detailed_household_summary_in_household | 0.397 | 0.051 | 0.054 | 0.111 | 0.276 | 0.067 | 0.064 | 0.063 | 0.981 | 1.000 | 0.220 | 0.234 | 0.018 | 0.397 | 0.336 | 0.529 | 0.067 | 0.215 | 0.055 | 0.037 | 0.073 | 0.220 | 0.223 | 0.419 | 0.128 | 0.051 | 0.048 | 0.065 | 0.068 | 0.228 | 0.136 | 0.072 | 0.063 | 0.048 | 0.372 | 0.225 | 0.662 | 0.603 | 0.036 | 0.221 | 0.006 |
| detailed_industry_recode | 0.245 | 0.049 | 0.038 | 0.098 | 0.639 | 0.029 | 0.029 | 0.035 | 0.266 | 0.220 | 1.000 | 0.425 | 0.009 | 0.321 | 0.127 | 0.276 | 0.029 | 0.360 | 0.052 | 0.018 | 0.036 | 0.915 | 0.599 | 0.195 | 0.261 | 0.031 | 0.028 | 0.036 | 0.039 | 0.404 | 0.208 | 0.057 | 0.155 | 0.030 | 0.301 | 0.280 | 0.488 | 0.387 | 0.067 | 0.305 | 0.007 |
| detailed_occupation_recode | 0.249 | 0.097 | 0.047 | 0.114 | 0.599 | 0.036 | 0.036 | 0.042 | 0.275 | 0.234 | 0.425 | 1.000 | 0.019 | 0.404 | 0.161 | 0.276 | 0.032 | 0.360 | 0.063 | 0.017 | 0.036 | 0.561 | 1.000 | 0.203 | 0.261 | 0.029 | 0.026 | 0.036 | 0.037 | 0.395 | 0.214 | 0.071 | 0.163 | 0.027 | 0.395 | 0.437 | 0.497 | 0.387 | 0.077 | 0.310 | 0.006 |
| dividends_from_stocks | 0.251 | 0.117 | 0.067 | 0.009 | 0.015 | 0.005 | 0.000 | 0.000 | 0.027 | 0.018 | 0.009 | 0.019 | 1.000 | 0.040 | 0.009 | 0.019 | 0.004 | 0.015 | 0.007 | 0.011 | 0.009 | 0.013 | 0.014 | 0.023 | 0.005 | 0.007 | 0.006 | 0.006 | 0.007 | 0.147 | 0.012 | 0.010 | 0.000 | 0.007 | 0.011 | 0.146 | 0.037 | 0.025 | -0.000 | 0.152 | 0.000 |
| education | 0.444 | 0.075 | 0.054 | 0.137 | 0.318 | 0.114 | 0.112 | 0.123 | 0.428 | 0.397 | 0.321 | 0.404 | 0.040 | 1.000 | 0.327 | 0.454 | 0.043 | 0.279 | 0.102 | 0.021 | 0.024 | 0.319 | 0.373 | 0.299 | 0.148 | 0.020 | 0.029 | 0.074 | 0.023 | 0.286 | 0.156 | 0.068 | 0.064 | 0.017 | 0.068 | 0.377 | 0.545 | 0.707 | 0.051 | 0.287 | 0.009 |
| enroll_in_edu_inst_last_wk | 0.434 | 0.022 | 0.021 | 0.018 | 0.090 | 0.036 | 0.036 | 0.029 | 0.205 | 0.336 | 0.127 | 0.161 | 0.009 | 0.327 | 1.000 | 0.157 | 0.014 | 0.073 | 0.022 | 0.022 | 0.019 | 0.133 | 0.118 | 0.199 | 0.025 | 0.021 | 0.014 | 0.027 | 0.020 | 0.071 | 0.070 | 0.026 | 0.087 | 0.021 | 0.014 | 0.065 | 0.173 | 0.102 | 0.023 | 0.184 | 0.003 |
| family_members_under_18 | 0.481 | 0.029 | 0.039 | 0.086 | 0.275 | 0.069 | 0.066 | 0.062 | 0.527 | 0.529 | 0.276 | 0.276 | 0.019 | 0.454 | 0.157 | 1.000 | 0.042 | 0.221 | 0.063 | 0.017 | 0.027 | 0.276 | 0.276 | 0.350 | 0.128 | 0.022 | 0.016 | 0.074 | 0.021 | 0.283 | 0.125 | 0.099 | 0.045 | 0.018 | 0.038 | 0.156 | 0.531 | 0.628 | 0.043 | 0.283 | 0.006 |
| fill_inc_questionnaire_for_veteran's_admin | 0.090 | 0.000 | 0.010 | 0.016 | 0.026 | 0.024 | 0.024 | 0.015 | 0.048 | 0.067 | 0.029 | 0.032 | 0.004 | 0.043 | 0.014 | 0.042 | 1.000 | 0.038 | 0.020 | 0.006 | 0.006 | 0.028 | 0.027 | 0.063 | 0.009 | 0.006 | 0.002 | 0.009 | 0.006 | 0.023 | 0.005 | 0.013 | 0.004 | 0.005 | 0.064 | 0.027 | 0.026 | 0.707 | 0.000 | 0.021 | 0.000 |
| full_or_part_time_employment_stat | 0.315 | 0.030 | 0.034 | 0.046 | 0.363 | 0.062 | 0.059 | 0.045 | 0.180 | 0.215 | 0.360 | 0.360 | 0.015 | 0.279 | 0.073 | 0.221 | 0.038 | 1.000 | 0.033 | 0.022 | 0.553 | 0.360 | 0.359 | 0.189 | 0.148 | 0.449 | 0.553 | 0.458 | 0.165 | 0.309 | 0.131 | 0.022 | 0.077 | 0.135 | 0.104 | 0.151 | 0.277 | 0.304 | 0.057 | 0.325 | 0.793 |
| hispanic_origin | 0.051 | 0.012 | 0.006 | 0.397 | 0.047 | 0.532 | 0.542 | 0.489 | 0.081 | 0.055 | 0.052 | 0.063 | 0.007 | 0.102 | 0.022 | 0.063 | 0.020 | 0.033 | 1.000 | 0.051 | 0.041 | 0.045 | 0.054 | 0.057 | 0.045 | 0.042 | 0.033 | 0.031 | 0.057 | 0.036 | 0.034 | 0.153 | 0.020 | 0.052 | 0.013 | 0.070 | 0.080 | 0.072 | 0.009 | 0.026 | 0.042 |
| instance_weight | 0.004 | 0.003 | 0.009 | 0.044 | 0.018 | 0.044 | 0.043 | 0.032 | 0.033 | 0.037 | 0.018 | 0.017 | 0.011 | 0.021 | 0.022 | 0.017 | 0.006 | 0.022 | 0.051 | 1.000 | 0.032 | 0.016 | 0.015 | 0.023 | 0.016 | 0.027 | 0.028 | 0.017 | 0.037 | 0.037 | 0.016 | 0.083 | 0.016 | 0.029 | 0.036 | 0.012 | 0.043 | 0.020 | 0.018 | 0.025 | 0.030 |
| live_in_this_house_1_year_ago | 0.115 | 0.008 | 0.004 | 0.031 | 0.037 | 0.037 | 0.036 | 0.033 | 0.073 | 0.073 | 0.036 | 0.036 | 0.009 | 0.024 | 0.019 | 0.027 | 0.006 | 0.553 | 0.041 | 0.032 | 1.000 | 0.034 | 0.029 | 0.061 | 0.010 | 0.992 | 0.818 | 1.000 | 0.707 | 0.035 | 0.049 | 0.045 | 0.033 | 0.707 | 0.006 | 0.029 | 0.046 | 0.019 | 0.008 | 0.041 | 0.986 |
| major_industry_code | 0.244 | 0.047 | 0.037 | 0.083 | 0.655 | 0.034 | 0.034 | 0.039 | 0.267 | 0.220 | 0.915 | 0.561 | 0.013 | 0.319 | 0.133 | 0.276 | 0.028 | 0.360 | 0.045 | 0.016 | 0.034 | 1.000 | 0.588 | 0.195 | 0.259 | 0.028 | 0.026 | 0.035 | 0.036 | 0.402 | 0.208 | 0.052 | 0.154 | 0.027 | 0.293 | 0.276 | 0.488 | 0.387 | 0.066 | 0.306 | 0.007 |
| major_occupation_code | 0.243 | 0.064 | 0.040 | 0.093 | 0.549 | 0.049 | 0.048 | 0.055 | 0.268 | 0.223 | 0.599 | 1.000 | 0.014 | 0.373 | 0.118 | 0.276 | 0.027 | 0.359 | 0.054 | 0.015 | 0.029 | 0.588 | 1.000 | 0.196 | 0.246 | 0.024 | 0.023 | 0.034 | 0.030 | 0.378 | 0.208 | 0.057 | 0.152 | 0.022 | 0.337 | 0.365 | 0.491 | 0.387 | 0.067 | 0.304 | 0.004 |
| marital_stat | 0.425 | 0.032 | 0.045 | 0.102 | 0.212 | 0.086 | 0.083 | 0.070 | 0.487 | 0.419 | 0.195 | 0.203 | 0.023 | 0.299 | 0.199 | 0.350 | 0.063 | 0.189 | 0.057 | 0.023 | 0.061 | 0.195 | 0.196 | 1.000 | 0.095 | 0.041 | 0.031 | 0.060 | 0.055 | 0.189 | 0.074 | 0.082 | 0.038 | 0.037 | 0.163 | 0.194 | 0.719 | 0.447 | 0.040 | 0.199 | 0.000 |
| member_of_a_labor_union | 0.172 | 0.019 | 0.020 | 0.013 | 0.278 | 0.043 | 0.042 | 0.028 | 0.105 | 0.128 | 0.261 | 0.261 | 0.005 | 0.148 | 0.025 | 0.128 | 0.009 | 0.148 | 0.045 | 0.016 | 0.010 | 0.259 | 0.246 | 0.095 | 1.000 | 0.011 | 0.009 | 0.024 | 0.009 | 0.227 | 0.068 | 0.022 | 0.041 | 0.011 | 0.030 | 0.074 | 0.166 | 0.126 | 0.351 | 0.221 | 0.000 |
| migration_code_change_in_msa | 0.073 | 0.014 | 0.002 | 0.092 | 0.029 | 0.053 | 0.053 | 0.059 | 0.062 | 0.051 | 0.031 | 0.029 | 0.007 | 0.020 | 0.021 | 0.022 | 0.006 | 0.449 | 0.042 | 0.027 | 0.992 | 0.028 | 0.024 | 0.041 | 0.011 | 1.000 | 0.856 | 0.793 | 0.706 | 0.025 | 0.050 | 0.049 | 0.024 | 0.629 | 0.005 | 0.031 | 0.050 | 0.020 | 0.005 | 0.030 | 0.981 |
| migration_code_change_in_reg | 0.066 | 0.006 | 0.002 | 0.021 | 0.021 | 0.029 | 0.029 | 0.028 | 0.047 | 0.048 | 0.028 | 0.026 | 0.006 | 0.029 | 0.014 | 0.016 | 0.002 | 0.553 | 0.033 | 0.028 | 0.818 | 0.026 | 0.023 | 0.031 | 0.009 | 0.856 | 1.000 | 1.000 | 0.447 | 0.029 | 0.041 | 0.041 | 0.033 | 0.467 | 0.005 | 0.014 | 0.025 | 0.016 | 0.008 | 0.040 | 0.986 |
| migration_code_move_within_reg | 0.086 | 0.014 | 0.002 | 0.092 | 0.052 | 0.040 | 0.040 | 0.045 | 0.089 | 0.065 | 0.036 | 0.036 | 0.006 | 0.074 | 0.027 | 0.074 | 0.009 | 0.458 | 0.031 | 0.017 | 1.000 | 0.035 | 0.034 | 0.060 | 0.024 | 0.793 | 1.000 | 1.000 | 0.743 | 0.043 | 0.057 | 0.044 | 0.028 | 0.709 | 0.009 | 0.038 | 0.092 | 0.112 | 0.003 | 0.038 | 1.000 |
| migration_prev_res_in_sunbelt | 0.108 | 0.008 | 0.004 | 0.029 | 0.036 | 0.047 | 0.048 | 0.042 | 0.069 | 0.068 | 0.039 | 0.037 | 0.007 | 0.023 | 0.020 | 0.021 | 0.006 | 0.165 | 0.057 | 0.037 | 0.707 | 0.036 | 0.030 | 0.055 | 0.009 | 0.706 | 0.447 | 0.743 | 1.000 | 0.028 | 0.046 | 0.025 | 0.033 | 0.873 | 0.005 | 0.029 | 0.043 | 0.007 | 0.008 | 0.041 | 0.295 |
| num_persons_worked_for_employer | 0.226 | 0.113 | 0.097 | 0.046 | 0.510 | 0.043 | 0.041 | 0.036 | 0.261 | 0.228 | 0.404 | 0.395 | 0.147 | 0.286 | 0.071 | 0.283 | 0.023 | 0.309 | 0.036 | 0.037 | 0.035 | 0.402 | 0.378 | 0.189 | 0.227 | 0.025 | 0.029 | 0.043 | 0.028 | 1.000 | 0.220 | 0.048 | 0.057 | 0.020 | 0.104 | 0.235 | 0.520 | 0.405 | 0.227 | 0.876 | 0.031 |
| own_business_or_self_employed | 0.192 | 0.027 | 0.023 | 0.024 | 0.208 | 0.049 | 0.046 | 0.025 | 0.095 | 0.136 | 0.208 | 0.214 | 0.012 | 0.156 | 0.070 | 0.125 | 0.005 | 0.131 | 0.034 | 0.016 | 0.049 | 0.208 | 0.208 | 0.074 | 0.068 | 0.050 | 0.041 | 0.057 | 0.046 | 0.220 | 1.000 | 0.033 | 0.042 | 0.049 | 0.047 | 0.083 | 0.184 | 0.125 | 0.020 | 0.240 | 0.012 |
| race | 0.056 | 0.011 | 0.010 | 0.240 | 0.047 | 0.440 | 0.443 | 0.380 | 0.075 | 0.072 | 0.057 | 0.071 | 0.010 | 0.068 | 0.026 | 0.099 | 0.013 | 0.022 | 0.153 | 0.083 | 0.045 | 0.052 | 0.057 | 0.082 | 0.022 | 0.049 | 0.041 | 0.044 | 0.025 | 0.048 | 0.033 | 1.000 | 0.022 | 0.045 | 0.022 | 0.060 | 0.108 | 0.056 | 0.009 | 0.040 | 0.050 |
| reason_for_unemployment | 0.080 | 0.004 | 0.004 | 0.027 | 0.436 | 0.019 | 0.018 | 0.024 | 0.040 | 0.063 | 0.155 | 0.163 | 0.000 | 0.064 | 0.087 | 0.045 | 0.004 | 0.077 | 0.020 | 0.016 | 0.033 | 0.154 | 0.152 | 0.038 | 0.041 | 0.024 | 0.033 | 0.028 | 0.033 | 0.057 | 0.042 | 0.022 | 1.000 | 0.024 | 0.047 | 0.028 | 0.077 | 0.069 | 0.011 | 0.113 | 0.014 |
| region_of_previous_residence | 0.069 | 0.014 | 0.000 | 0.092 | 0.027 | 0.062 | 0.062 | 0.066 | 0.060 | 0.048 | 0.030 | 0.027 | 0.007 | 0.017 | 0.021 | 0.018 | 0.005 | 0.135 | 0.052 | 0.029 | 0.707 | 0.027 | 0.022 | 0.037 | 0.011 | 0.629 | 0.467 | 0.709 | 0.873 | 0.020 | 0.049 | 0.045 | 0.024 | 1.000 | 0.007 | 0.030 | 0.044 | 0.008 | 0.003 | 0.028 | 0.295 |
| sex | 0.061 | 0.058 | 0.073 | 0.010 | 0.123 | 0.025 | 0.024 | 0.028 | 0.057 | 0.372 | 0.301 | 0.395 | 0.011 | 0.068 | 0.014 | 0.038 | 0.064 | 0.104 | 0.013 | 0.036 | 0.006 | 0.293 | 0.337 | 0.163 | 0.030 | 0.005 | 0.005 | 0.009 | 0.005 | 0.104 | 0.047 | 0.022 | 0.047 | 0.007 | 1.000 | 0.159 | 0.033 | 0.072 | 0.039 | 0.117 | 0.000 |
| target | 0.242 | 0.318 | 0.172 | 0.038 | 0.235 | 0.071 | 0.070 | 0.059 | 0.195 | 0.225 | 0.280 | 0.437 | 0.146 | 0.377 | 0.065 | 0.156 | 0.027 | 0.151 | 0.070 | 0.012 | 0.029 | 0.276 | 0.365 | 0.194 | 0.074 | 0.031 | 0.014 | 0.038 | 0.029 | 0.235 | 0.083 | 0.060 | 0.028 | 0.030 | 0.159 | 1.000 | 0.217 | 0.141 | 0.072 | 0.268 | 0.015 |
| tax_filer_stat | 0.585 | 0.059 | 0.095 | 0.055 | 0.486 | 0.077 | 0.072 | 0.058 | 0.538 | 0.662 | 0.488 | 0.497 | 0.037 | 0.545 | 0.173 | 0.531 | 0.026 | 0.277 | 0.080 | 0.043 | 0.046 | 0.488 | 0.491 | 0.719 | 0.166 | 0.050 | 0.025 | 0.092 | 0.043 | 0.520 | 0.184 | 0.108 | 0.077 | 0.044 | 0.033 | 0.217 | 1.000 | 0.503 | 0.079 | 0.531 | 0.000 |
| veterans_benefits | 0.644 | 0.037 | 0.054 | 0.084 | 0.389 | 0.080 | 0.076 | 0.084 | 0.504 | 0.603 | 0.387 | 0.387 | 0.025 | 0.707 | 0.102 | 0.628 | 0.707 | 0.304 | 0.072 | 0.020 | 0.019 | 0.387 | 0.387 | 0.447 | 0.126 | 0.020 | 0.016 | 0.112 | 0.007 | 0.405 | 0.125 | 0.056 | 0.069 | 0.008 | 0.072 | 0.141 | 0.503 | 1.000 | 0.056 | 0.395 | 0.003 |
| wage_per_hour | 0.039 | 0.005 | 0.005 | 0.017 | 0.083 | 0.005 | 0.007 | 0.007 | 0.052 | 0.036 | 0.067 | 0.077 | -0.000 | 0.051 | 0.023 | 0.043 | 0.000 | 0.057 | 0.009 | 0.018 | 0.008 | 0.066 | 0.067 | 0.040 | 0.351 | 0.005 | 0.008 | 0.003 | 0.008 | 0.227 | 0.020 | 0.009 | 0.011 | 0.003 | 0.039 | 0.072 | 0.079 | 0.056 | 1.000 | 0.218 | 0.007 |
| weeks_worked_in_year | 0.269 | 0.130 | 0.105 | 0.035 | 0.448 | 0.030 | 0.028 | 0.021 | 0.276 | 0.221 | 0.305 | 0.310 | 0.152 | 0.287 | 0.184 | 0.283 | 0.021 | 0.325 | 0.026 | 0.025 | 0.041 | 0.306 | 0.304 | 0.199 | 0.221 | 0.030 | 0.040 | 0.038 | 0.041 | 0.876 | 0.240 | 0.040 | 0.113 | 0.028 | 0.117 | 0.268 | 0.531 | 0.395 | 0.218 | 1.000 | 0.008 |
| year | 0.009 | 0.006 | 0.003 | 0.012 | 0.003 | 0.030 | 0.030 | 0.024 | 0.005 | 0.006 | 0.007 | 0.006 | 0.000 | 0.009 | 0.003 | 0.006 | 0.000 | 0.793 | 0.042 | 0.030 | 0.986 | 0.007 | 0.004 | 0.000 | 0.000 | 0.981 | 0.986 | 1.000 | 0.295 | 0.031 | 0.012 | 0.050 | 0.014 | 0.295 | 0.000 | 0.015 | 0.000 | 0.003 | 0.007 | 0.008 | 1.000 |
Missing values
Sample
| age | class_of_worker | detailed_industry_recode | detailed_occupation_recode | education | wage_per_hour | enroll_in_edu_inst_last_wk | marital_stat | major_industry_code | major_occupation_code | race | hispanic_origin | sex | member_of_a_labor_union | reason_for_unemployment | full_or_part_time_employment_stat | capital_gains | capital_losses | dividends_from_stocks | tax_filer_stat | region_of_previous_residence | state_of_previous_residence | detailed_household_and_family_stat | detailed_household_summary_in_household | instance_weight | migration_code_change_in_msa | migration_code_change_in_reg | migration_code_move_within_reg | live_in_this_house_1_year_ago | migration_prev_res_in_sunbelt | num_persons_worked_for_employer | family_members_under_18 | country_of_birth_father | country_of_birth_mother | country_of_birth_self | citizenship | own_business_or_self_employed | fill_inc_questionnaire_for_veteran's_admin | veterans_benefits | weeks_worked_in_year | year | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 73 | Not in universe | Not in universe or children | Not in universe | High School Graduate | 0 | Not in universe | Widowed | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Not Employed | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Extended Family | Other relative of householder | 1700.09 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 0 | 1995 | 1 |
| 1 | 58 | Self-employed | Manufacturing-durable goods | Automobile mechanics and repairers | Some College | 0 | Not in universe | Divorced | Construction | Precision production craft & repair | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Individual Filer | South | Arkansas | Primary Householder | Householder | 1053.55 | MSA movement | Same area | Same county | No | Yes | 1 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 52 | 1994 | 1 |
| 2 | 18 | Not in universe | Not in universe or children | Not in universe | Below High School | 0 | High school | Never Married | Not in universe or children | Not in universe | Asian or Pacific Islander | All other | Female | Not in universe | Not in universe | Not Employed | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child 18 or older | 991.95 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Not in universe | Vietnam | Vietnam | Vietnam | Foreign | Not in universe | Not in universe | Not a Veteran | 0 | 1995 | 1 |
| 3 | 9 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 1758.14 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1994 | 1 |
| 4 | 10 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 1069.16 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1994 | 1 |
| 5 | 48 | Private sector | Personal services | Teachers, except college and university | Some College | 1200 | Not in universe | Married | Entertainment | Professional specialty | Amer Indian Aleut or Eskimo | All other | Female | No | Not in universe | FTE | 0 | 0 | 0 | Joint Filer | Not in universe | Not in universe | Primary Householder | Spouse of householder | 162.61 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 1 | Not in universe | Philippines | United-States | United-States | Native | No | Not in universe | Not a Veteran | 52 | 1995 | 1 |
| 6 | 42 | Private sector | Manufacturing-durables | Management related occupations | College Graduate | 0 | Not in universe | Married | Finance insurance and real estate | Executive admin and managerial | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 5178 | 0 | 0 | Joint Filer | Not in universe | Not in universe | Primary Householder | Householder | 1535.86 | No movement | Same area | Nonmover | Yes | Not in universe | 6 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 52 | 1994 | 1 |
| 7 | 28 | Private sector | Manufacturing-durable goods | Fabricators, assemblers, and hand working | High School Graduate | 0 | Not in universe | Never Married | Construction | Handlers equip cleaners etc | White | All other | Female | Not in universe | Job loser | FTE | 0 | 0 | 0 | Individual Filer | Not in universe | Not in universe | Other | Nonrelative of householder | 898.83 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 4 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 30 | 1995 | 1 |
| 8 | 47 | Government | Manufacturing | Food service occupations | Some College | 876 | Not in universe | Married | Education | Adm support including clerical | White | All other | Female | No | Not in universe | FTE | 0 | 0 | 0 | Joint Filer | Not in universe | Not in universe | Primary Householder | Spouse of householder | 1661.53 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 5 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 52 | 1995 | 1 |
| 9 | 34 | Private sector | Manufacturing-durable goods | Extractive occupations | Some College | 0 | Not in universe | Married | Construction | Machine operators assmblrs & inspctrs | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Joint Filer | Not in universe | Not in universe | Primary Householder | Householder | 1146.79 | No movement | Same area | Nonmover | Yes | Not in universe | 6 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 52 | 1994 | 1 |
| age | class_of_worker | detailed_industry_recode | detailed_occupation_recode | education | wage_per_hour | enroll_in_edu_inst_last_wk | marital_stat | major_industry_code | major_occupation_code | race | hispanic_origin | sex | member_of_a_labor_union | reason_for_unemployment | full_or_part_time_employment_stat | capital_gains | capital_losses | dividends_from_stocks | tax_filer_stat | region_of_previous_residence | state_of_previous_residence | detailed_household_and_family_stat | detailed_household_summary_in_household | instance_weight | migration_code_change_in_msa | migration_code_change_in_reg | migration_code_move_within_reg | live_in_this_house_1_year_ago | migration_prev_res_in_sunbelt | num_persons_worked_for_employer | family_members_under_18 | country_of_birth_father | country_of_birth_mother | country_of_birth_self | citizenship | own_business_or_self_employed | fill_inc_questionnaire_for_veteran's_admin | veterans_benefits | weeks_worked_in_year | year | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 199513 | 57 | Private sector | Wholesale trade | Extractive occupations | Below High School | 0 | Not in universe | Divorced | Manufacturing-durable goods | Machine operators assmblrs & inspctrs | White | Central or South American | Female | Not in universe | Not in universe | FTE | 0 | 0 | 0 | Individual Filer | Not in universe | Not in universe | Primary Householder | Householder | 743.66 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 4 | Not in universe | Dominican-Republic | Dominican-Republic | Dominican-Republic | Foreign | Not in universe | Not in universe | Not a Veteran | 52 | 1995 | 1 |
| 199514 | 51 | Private sector | Public administration | Computer equipment operators | Below High School | 0 | Not in universe | Widowed | Retail trade | Sales | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Individual Filer | South | North Dakota | Primary Householder | Householder | 1302.34 | Non-MSA movement | Same area | Same county | No | Yes | 6 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 52 | 1994 | 1 |
| 199515 | 87 | Not in universe | Not in universe or children | Not in universe | High School Graduate | 0 | Not in universe | Widowed | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Not Employed | 0 | 0 | 0 | Individual Filer | Not in universe | Not in universe | Primary Householder | Householder | 3255.80 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Not in universe | ? | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 0 | 1995 | 1 |
| 199516 | 3 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | Black | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | South | Utah | Child | Nonrelative of householder | 2733.75 | MSA movement | Same area | Same county | No | Yes | 0 | Mother only present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1994 | 1 |
| 199517 | 39 | Private sector | Manufacturing | Food service occupations | College Graduate | 0 | Not in universe | Never Married | Education | Adm support including clerical | Other | Mexican-American | Male | No | Not in universe | FTE | 6849 | 0 | 0 | Individual Filer | Not in universe | Not in universe | Primary Householder | Householder | 908.14 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 6 | Not in universe | Mexico | Mexico | Mexico | Foreign | No | Not in universe | Not a Veteran | 52 | 1995 | 1 |
| 199518 | 87 | Not in universe | Not in universe or children | Not in universe | Below High School | 0 | Not in universe | Married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Not Employed | 0 | 0 | 0 | Joint Filer | Not in universe | Not in universe | Primary Householder | Householder | 955.27 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Not in universe | Canada | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 0 | 1995 | 1 |
| 199519 | 65 | Self-employed | Wholesale and retail trade | Other executive, admin and managerial | Below High School | 0 | Not in universe | Married | Business and repair services | Executive admin and managerial | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 6418 | 0 | 9 | Joint Filer | Not in universe | Not in universe | Primary Householder | Householder | 687.19 | No movement | Same area | Nonmover | Yes | Not in universe | 1 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 52 | 1994 | 1 |
| 199520 | 47 | Not in universe | Not in universe or children | Not in universe | Some College | 0 | Not in universe | Married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 157 | Joint Filer | Not in universe | Not in universe | Primary Householder | Householder | 1923.03 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 6 | Not in universe | Poland | Poland | Germany | Naturalized | Not in universe | Not in universe | Not a Veteran | 52 | 1995 | 1 |
| 199521 | 16 | Not in universe | Not in universe or children | Not in universe | Below High School | 0 | High school | Never Married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Not Employed | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 4664.87 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 0 | 1995 | 1 |
| 199522 | 32 | Private sector | Public administration and armed forces | Farm operators and managers | High School Graduate | 0 | Not in universe | Never Married | Medical except hospital | Other service | Black | All other | Female | No | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Individual Filer | Not in universe | Not in universe | Primary Householder | Householder | 1830.11 | No movement | Same area | Nonmover | Yes | Not in universe | 6 | Not in universe | ? | ? | ? | Foreign | Not in universe | Not in universe | Not a Veteran | 52 | 1994 | 1 |
Duplicate rows
Most frequently occurring
| age | class_of_worker | detailed_industry_recode | detailed_occupation_recode | education | wage_per_hour | enroll_in_edu_inst_last_wk | marital_stat | major_industry_code | major_occupation_code | race | hispanic_origin | sex | member_of_a_labor_union | reason_for_unemployment | full_or_part_time_employment_stat | capital_gains | capital_losses | dividends_from_stocks | tax_filer_stat | region_of_previous_residence | state_of_previous_residence | detailed_household_and_family_stat | detailed_household_summary_in_household | instance_weight | migration_code_change_in_msa | migration_code_change_in_reg | migration_code_move_within_reg | live_in_this_house_1_year_ago | migration_prev_res_in_sunbelt | num_persons_worked_for_employer | family_members_under_18 | country_of_birth_father | country_of_birth_mother | country_of_birth_self | citizenship | own_business_or_self_employed | fill_inc_questionnaire_for_veteran's_admin | veterans_benefits | weeks_worked_in_year | year | target | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 14 | 15 | Not in universe | Not in universe or children | Not in universe | Below High School | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 1217.42 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 0 | 1994 | 1 | 3 |
| 79 | 17 | Not in universe | Not in universe or children | Not in universe | Below High School | 0 | High school | Never Married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 1724.96 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not a Veteran | 0 | 1994 | 1 | 3 |
| 0 | 0 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Extended Family | Other relative of householder | 1706.01 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Mother only present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1995 | 1 | 2 |
| 1 | 1 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | Black | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Other | Nonrelative of householder | 4118.09 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Not in universe | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1994 | 1 | 2 |
| 2 | 1 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | Mexican (Mexicano) | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Extended Family | Other relative of householder | 1231.01 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Both parents present | Mexico | Mexico | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1994 | 1 | 2 |
| 3 | 3 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Extended Family | Other relative of householder | 1875.27 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Both parents present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1994 | 1 | 2 |
| 4 | 4 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Extended Family | Other relative of householder | 1332.16 | No movement | Same area | Nonmover | Yes | Not in universe | 0 | Neither parent present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1994 | 1 | 2 |
| 5 | 5 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | Asian or Pacific Islander | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Extended Family | Other relative of householder | 753.84 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Both parents present | Philippines | Philippines | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1995 | 1 | 2 |
| 6 | 5 | Not in universe | Not in universe or children | Not in universe | Children | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Extended Family | Other relative of householder | 1175.34 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Neither parent present | United-States | United-States | United-States | Native | Not in universe | Not in universe | Not in universe | 0 | 1995 | 1 | 2 |
| 7 | 15 | Not in universe | Not in universe or children | Not in universe | Below High School | 0 | Not in universe | Never Married | Not in universe or children | Not in universe | Asian or Pacific Islander | All other | Female | Not in universe | Not in universe | Not Employed | 0 | 0 | 0 | Non-Filer | Not in universe | Not in universe | Child | Child under 18 never married | 709.34 | Not in universe | Not in universe | ? | Not in universe | Not in universe | 0 | Both parents present | ? | ? | ? | Foreign | Not in universe | Not in universe | Not a Veteran | 0 | 1995 | 1 | 2 |